Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3989 |
Symbol | |
ID | 8885190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 4257990 |
End bp | 4261001 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512734 |
Protein GI | 291301456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0138619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.888144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTTCT CGATACTCGG ACCGTTGCGA GTCCAGCACG GCGGCCGTTC CGTCCCGATC TCCGGTCGCC ACGCTCCCAA GCTGCTTGCG GTATTGCTGG TCGACGCCGG GCGTCTCGTC ACCGTCACCC GTCTGATCGA AACGCTGTGG GCCGAGGATC CACCCGCCAC AGCGAAGCGA CAGGTCCAAA ACGTTATGGC GGCGTTGCGC CGCGCGTTGC CGGAACCGGA TGCCATCGAG GCGACCGGCA ACGGCTACCG ATTGGAGCTT GAATCGGCCA CTGTGGATCT GCGGGAGTTT GAGCGATTGC GGTCGCTCGC CCGAGCGGCC GAACCACGGG AGGCATTGCG ATATCTGCGA GAAGCGTTGC GACTGTGGCG GGGCGAGGCC CTCGCGGGCT TGACGGGGAG CAGCATCGAA GCGGCGGCGG CGAAGCTCAA CGAGGCGAGG ATCTCGGCCA TCGAGGAACG CGTCGAGCTC GAGCTGGACC TGGAACGACG TTTGGACGTC GTCGGCGAAC TGCGGGAGCT GCTGGTGGCG CACCCGTTCC GGCAACGGTT GACGGGACAA CTGATGCGGG CGCTTTGGTA CGCGGGGAGC AAGTCAGAGG CGTTGGAGGT CTACCGGAAA CTGCGGAAAC GATTGGCCGA GGGGCTGGGT CTGGATCCGG ACGCCGAACT GGCGCGGTTG CACTCGGCGA TCCTGCGGGG TGAGCCCGAA ACCCCCGCTG CCGACCGGAC CACAGCGGCG GTGCGGGAGC GGACGGTCCC GGCACAGCTG CCCGCCGCTC CGTCGACGTT CACGGGTCGG CGTAAGCAGG TGGTGGCGCT GGACGAGCTA CTGGAGCAGG GGCGGAACAC GGCGGTGGTG TCAGCGATCG CGGGGATGGG TGGTGCGGGG AAGACGGCGC TGGCGTTGTA TTGGGGGCAC CGGGTGCGGG AGCGGTTTCC AGACGGGCAG TTGTACATCA ACTTGCGCGG GTACGACGAA GCGAAACCGG TCGCCGCGAT CGACGCGCTG GGGCGGTTTC TGGTGGCACT GGGGCAAACC AGTACGACTG TCCCATCCGA TGTGGATGAG GCGGCGGCAT TGTTCCGGTC GCTGCTGTCC GAGCGGCGGA TGCTCGTCAT TCTCGACAAC GCCCGCGAGG CAGCCCAGGT CCGACCGCTG CTGCCGGGCG GCGCCGGGAA CCTGGCGATC GTGACCAGTC GGGATCGGCT GGCAAGCCTC ACCGCACTTG AAGGCGCTGA GCCGATTCGG CTGGATACCT TGAGCCAGAC GGAGTCACTG GAGTTGCTGG CGAACATCGT GGGAGCCGGG CGGCTGGACA CTGACCCGGA AGCGGCACAT CGGATCGCCG AGCTGTGCGG ACGGCTGCCG TTGGCCTTGC GGATCGCGGG GGCGAGCTTG GCGGCGCAGC CAGACCTGGC ACTCGGTGAG TTCACCGATG TACTGGGCGG GCCGGATCGG TTGCGGCGAT TGGCGCTTGA CGGGGACAAG CTCGCCAGCG TGTCCAACGT GCTCGAATTG TCCGTGGCAG CACTCGATGA CACGAGCCGC GAACTGCTGC TGAAGCTGGC GCAGATCCTG GGCGACGACT TCTGTCATGG ACTGGCGGTC CACCTGTCCG AACTGGACGA AACCGGGGCC GGGCGAAATC TGGCGGCCCT GGAGGCCGCA CATCTCATTG AGCAGCACAT CCCGTCCCGG TACCGTATTC ACGACTTGAC TCGCGATTAC ATGAGACAAC AGGGTCGCCG GACCTTCGAC GACGTTCGTC TTCACGACAT TCAAACCTGT TTCATCACTT GGCATTACGC TGTCCGGCGC GAAATTTCCG TCACAGAGGC ATCGAATGTC GTTTCCGCGT TCAACGCGTG GCGAGAACAT CCTGAGATCT GGAAACTCGC GACGATGTTC TCAGTCTTCA GCGGGACCGA GTACAAGCCC ACTCAGCTAC TTCAGTTGGC CGAGTGGGCA CTCGTCGCGA GAAAAGACGG CATGGATCCC ATGGGCAACG CGTATCTCCT CATGGAGATC GCGATTCTGT ATCGCGCCAT GGGAAACCGG AATCTGGCGG TCCGCGAAGC AGAGAAGGCC ATCGATGTCG TTCACCGAGC GGGTCTGGAG GATCCCGACG GGCGTTTCCG GGGGAATCTG GCATTGATGT ACATGGAGGT GGGCCAGTAC ATCCAAGCCG AGACGCTCAT GCGAGACGCG CTGCGATCGG CGCGAGAGTC AGGTGACGCC CAGAACATCA AGTCCTGTTC GTCGTCCCTG GCCGCCATCT GCCGCAGGTT GGGTAAGTTC GCCGACGCCG AGACACTACT CAACGGTGTC ATTGACAATC CGGAGCTGCC CACACAACCC ACTCTGGACA TTACCGCGAA GGCCCAACTA GGCGCCCTCT ATCTGGACAT CGGCCGCCTT ACCGAGGCTC TCACGGTCCT TGACGAGGTG CACTCGCTGC CGCCGGACGT GGGAGGAATG CGATCACGCA CGTTCTCGCG AATCCTGCGT ACTGAGGCTC TCTGCGCGCT GGGCCGGTAC GAATCCGCGC GTCCAGAGTT GACCGACGTA CTCGCTGTCG CCGTTCGCAT GGACCTCACC GGGGCGGTCA TGCTGGCTAC GATCCAGTTG GCGCACCTCC ACAGCGATTC GGGAGATCAA CAAGCCGCGC TACGAGCCCT GGACACACTC GGCCCCCATC ACCTCAACGA ATCCGACCAG AAATTCGCCG CTGAGATCGC GCGGCTGCGG TGCATCACCA ACACCCGGCT TCGACGGTTC GCGAAAGCCG TCACGTTCGG CGGTTACGCG TGTGACCGCT ACGCGAACAT GTCCTATCCG CTGATGCACG CCAGGTCGCT GGCGGCGCTG GCCAACGCGT ACGAGGGTGC AAACAATCCC GCCCAAACTA CGGCCTGTCG CGCGCAGGCC TTCGATATCT TCTCCCGGCT GGGCGTCCCC GAAGCGGACG AACTCCGCGA GCTCCTCGGC CCCGCCCCGT AA
|
Protein sequence | MEFSILGPLR VQHGGRSVPI SGRHAPKLLA VLLVDAGRLV TVTRLIETLW AEDPPATAKR QVQNVMAALR RALPEPDAIE ATGNGYRLEL ESATVDLREF ERLRSLARAA EPREALRYLR EALRLWRGEA LAGLTGSSIE AAAAKLNEAR ISAIEERVEL ELDLERRLDV VGELRELLVA HPFRQRLTGQ LMRALWYAGS KSEALEVYRK LRKRLAEGLG LDPDAELARL HSAILRGEPE TPAADRTTAA VRERTVPAQL PAAPSTFTGR RKQVVALDEL LEQGRNTAVV SAIAGMGGAG KTALALYWGH RVRERFPDGQ LYINLRGYDE AKPVAAIDAL GRFLVALGQT STTVPSDVDE AAALFRSLLS ERRMLVILDN AREAAQVRPL LPGGAGNLAI VTSRDRLASL TALEGAEPIR LDTLSQTESL ELLANIVGAG RLDTDPEAAH RIAELCGRLP LALRIAGASL AAQPDLALGE FTDVLGGPDR LRRLALDGDK LASVSNVLEL SVAALDDTSR ELLLKLAQIL GDDFCHGLAV HLSELDETGA GRNLAALEAA HLIEQHIPSR YRIHDLTRDY MRQQGRRTFD DVRLHDIQTC FITWHYAVRR EISVTEASNV VSAFNAWREH PEIWKLATMF SVFSGTEYKP TQLLQLAEWA LVARKDGMDP MGNAYLLMEI AILYRAMGNR NLAVREAEKA IDVVHRAGLE DPDGRFRGNL ALMYMEVGQY IQAETLMRDA LRSARESGDA QNIKSCSSSL AAICRRLGKF ADAETLLNGV IDNPELPTQP TLDITAKAQL GALYLDIGRL TEALTVLDEV HSLPPDVGGM RSRTFSRILR TEALCALGRY ESARPELTDV LAVAVRMDLT GAVMLATIQL AHLHSDSGDQ QAALRALDTL GPHHLNESDQ KFAAEIARLR CITNTRLRRF AKAVTFGGYA CDRYANMSYP LMHARSLAAL ANAYEGANNP AQTTACRAQA FDIFSRLGVP EADELRELLG PAP
|
| |