Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0853 |
Symbol | |
ID | 8882037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 902251 |
End bp | 905283 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509657 |
Protein GI | 291298379 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTATC GGATTCTGGG GCCGCTGGAG GTCGTCCGGG ACGGGGTTCC GATCGCCATC AAGGGGCGGC ACCAACCGCG CCTGCTCGCG ATGCTGCTAC TGGAGGCGGG CCAGACGGTG ACGCTTTCGC GGCTGGTGGA CGTGCTGTGG GACGAAGAAC CCCCGGAGAC GGCCCGGCGC CAAGCGCAGA ACTACATGGC GGCGTTGCGC AGAACCCTCG GCACGAGCAA CCCGATCGAG GTGGTGGGGG AGGGATACCG GTTCGCGGCT GGGGACAGCT TTGTGGACTC CGTTCGGTTC GAGGAGCTGA ACCACCTGGC GCGTCACGAC GCACGCGAGG GCAGACATGC CAAGGCGTTG CCGAGCTTCG ACGAGGCACT GGGCCTGTGG CGAGGACCGA CCCTGGCGGG GTTGTCGGCG CGGTGGCTGG AATCGCGGTC AAGGCGACTC GACGATCTCC GCCTCGGCAT GATCGAGGAC CGCGCGGCGA GCCTCCTCGA ACTGGGGCGA CACGCGGACG TGGTGGGGGA GCTGGCGGAA CTGTTGGCGG AGAAGCCGTA TCGACAGCAG GCGGCACGAC ATCTGATGCT GGCGCTGTAT CGGTGTGGCA GGGGTGCCGA GGCGCTGGAG GTCTTTGGGG CACTACGGAC GCGGTTGGCG GAGGAACTGG GGATCGACCC GAATCCGATG GTGCGGGAGC TGCATGGGCG GATCCTGCGG GAGGACGCGT CGCTGGCGGT GCCGCGGCCA GAGATGTCAA TCGTTGCGGC TAAGCCAGAG ATTCTGGTGC CCGCACAGCT TCCGGCGGGC GTGGCGACGT TCACCGGCCG GGATGAAGAG CTGGCCGAGT TGGACAGGCT GTTCGACTCC GGAGCCGCTG TCACCGTATT GTCGGCGATC TCCGGTGGTG GTGGCGTGGG AAAGACTGCG TTGGCGGTCC ACTGGAGTCG GAATCGGACC GAAAGGTTCC CTGACGGACA GCTCTACGTC AACCTGCGCG GTTTTGACCA CAATGAACCG TTGAAGCCCA TTGACGCGCT ATCACGATTC CTTCGTGCGC TTGGGACTCC GAGCGCCAAA ATTCCGGCAG AGACCGAGGA GGCGTCGGCG CTGTTTCGCT CGGTGATGAA CGGCCGGAAC ATGCTGGTGG TCTTGGACAA CGCCCGAACT GCTGAGCAGG TGCGACCGTT GTTGCCGGGC GGCCAGGACA ACGCCGTCCT GGTGACGAGC CGCAATCGGC TGGCGAGCCT GGCGGCGCTC AATGACGCCA AGCTCATGGC TTTGGATGTG CTCAGCCTTA CGGAGTCGTT GGAGCTACTG GCCGAGCTGA TTGGGGCTGA TCGGGTTAAC GCGGATCCGG ATTCTGCCCG TCGGCTGGTC GAATTGTGCG GTCATCTTCC GCTGGCATTG CGGATCGCGG CCGCTAGCCT GGCGGCAAGG TCCGATGGCT CGATATCCAA TTTGGCGTCC GAACTTGACA GCGTTTCTCG GCTGGAGATC CTCAGCATCG AGGGCGACCC GTACTCAGCT GTTACGGCGA CGTTTGACCT ATCAGTCGGC GCTTTGAGCA TTGAGGCCCG GGATCTGTTC CTGCGCCTGG GGATGATTCC GGGTGAGGAC ATTGCGGAAG GACTGGCCGT TGAGGTTTCG GGCCACCAGG AGGATCGAGC CAAGGACCTG TTGCAGGGTC TGGTTTCTGC ACACTTGCTC GAGACGCATG TTCCAGGGAG GTACCGCTTC CATGACCTGG TGAGAATCTA TGCGCATCGT TGCGCAATTG ACAAGTTGAG TAGCTCAATT TGTGAATCGA TTATCGATAC GCTCGTCGCT TGGTACTTCC AGAATCGTGG TCGAATCAGT CCCGATGAAA TTCCAAATGT GGTTACCACC TGCAAGTTGC TGAACAAACA TCCGGAGATG TGGCGGGTGG CATACACCTT GAGCATGATC ACGCACCACG GGCTCGGTTT GAAAGTCGCC CGTGACCAAT GTGAGATCGC TTTGAGGATC GCTGAGGTGT ATAACGACAG GCATGGCCAA GCACGCATGC TGTCTGCACT TGGGGGGATT CAATATGCAC TAGGGGAAAC GGAGCTTTCC ATTGAATTGT CCCGGAAATC CGTAGCCACA ATAGATCCTA ATGTGGACTC ACGGACATAC GGCGACTTGT GTAACAACCT CGGAATTCGG CTTAGTTGGA GCGGTCGGTA TCGGGAAGCC GAGGAGTGGC TGGTGAAGTC ATTGCAGCCG GGTTATATGG CTGAAGGCGT TCATTTTGAA CTGGTGCGCG TGCTTAATTT GGGAACTGTG TACAGGGCGT TGGGGCGTTA TGGTGATGCA TTGGCATACC TGGATCGAAG TGCGGCACTT TTGTCGAAGC TCGAAAGCCG ATCAATCGCC GCGAGCATTT ATCTCGCCAG AGCGCTATTG GAGAACGATC GAGGTCGCCA CCGTAAAGCT TGGGCCTATG CCCAGGAGGC ATTGCTTGCA GGTCAGCAGT GCAAGTCTGT TCGTTCGATA GTGATCGCGC TCCAGCAACG AGGTTTGGCA GAGTTTGGGT GCGGGAAAAT CGAGCGAGCG CGTGAGGACT ATCTTGCGTC TACAGTACGA GCAAATGAAC ACGGAATGTC TACAGTAGAG CGTCGCAACT ACTGTTTTCT CGCGGACCTC GAATGTTTCG CTGGCGACAA TAGTCGAGCA GTGAAGTATC TGGAGTCAGC AGCAGCAGTG GAAGTCTTGC AACCTCCTAG ATACCTGGTT GCAGAGATGC TGCGCGTGGA CAGCAACGTT CGTTTGAAAC TGAACGCGTA TCGGCATGCT GTAAGTACGG GACGCAAGGC CGCAGCCTTG TTCGCCTCAA TGCCTGATCC GCTGCGCCAT GCGCGTTCGT TGGTGATAGT CGCTAGGGCA TATGAGGGCT TGGCCGACTA CAACAAAGCA GCAGACGCTC GCCAAGAAGC CCTAGCCATC TTCACTCGTC TGGGAGTCCC CGAAACCGCA GTGCTGCGGG CCGAAATCGA CGCTCACAGC TGA
|
Protein sequence | MEYRILGPLE VVRDGVPIAI KGRHQPRLLA MLLLEAGQTV TLSRLVDVLW DEEPPETARR QAQNYMAALR RTLGTSNPIE VVGEGYRFAA GDSFVDSVRF EELNHLARHD AREGRHAKAL PSFDEALGLW RGPTLAGLSA RWLESRSRRL DDLRLGMIED RAASLLELGR HADVVGELAE LLAEKPYRQQ AARHLMLALY RCGRGAEALE VFGALRTRLA EELGIDPNPM VRELHGRILR EDASLAVPRP EMSIVAAKPE ILVPAQLPAG VATFTGRDEE LAELDRLFDS GAAVTVLSAI SGGGGVGKTA LAVHWSRNRT ERFPDGQLYV NLRGFDHNEP LKPIDALSRF LRALGTPSAK IPAETEEASA LFRSVMNGRN MLVVLDNART AEQVRPLLPG GQDNAVLVTS RNRLASLAAL NDAKLMALDV LSLTESLELL AELIGADRVN ADPDSARRLV ELCGHLPLAL RIAAASLAAR SDGSISNLAS ELDSVSRLEI LSIEGDPYSA VTATFDLSVG ALSIEARDLF LRLGMIPGED IAEGLAVEVS GHQEDRAKDL LQGLVSAHLL ETHVPGRYRF HDLVRIYAHR CAIDKLSSSI CESIIDTLVA WYFQNRGRIS PDEIPNVVTT CKLLNKHPEM WRVAYTLSMI THHGLGLKVA RDQCEIALRI AEVYNDRHGQ ARMLSALGGI QYALGETELS IELSRKSVAT IDPNVDSRTY GDLCNNLGIR LSWSGRYREA EEWLVKSLQP GYMAEGVHFE LVRVLNLGTV YRALGRYGDA LAYLDRSAAL LSKLESRSIA ASIYLARALL ENDRGRHRKA WAYAQEALLA GQQCKSVRSI VIALQQRGLA EFGCGKIERA REDYLASTVR ANEHGMSTVE RRNYCFLADL ECFAGDNSRA VKYLESAAAV EVLQPPRYLV AEMLRVDSNV RLKLNAYRHA VSTGRKAAAL FASMPDPLRH ARSLVIVARA YEGLADYNKA ADARQEALAI FTRLGVPETA VLRAEIDAHS
|
| |