Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1973 |
Symbol | |
ID | 8883165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 2092183 |
End bp | 2095209 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003510762 |
Protein GI | 291299484 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.458427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGTCC GGCTCCTCGG TGGGGTGGAG GTCGCGGGCG CTGAGGGCCA GTGGCGGACG GTGCCCGGCA AACGCGCCGG TGTCCTGGCG GTCCTGGCCG TCGCGGCGGG CGAACCGGTG CCCGCCGACG AACTGGTGCA CCGGGTCTGG GGCGCCTCGG CCGTCGCTCT GGCCGACGGC GCGGTCTACC CGCACATCAC CCGGCTGCGG GCGACGCTGT CACCGGCCGG ACTCACCATC TCCCGGGCTC GCGGCGGCTA CGTCCTCGAC CTCGACGCCG ACCAGGTCGA TCTACTGACC ATACGGGCGC TGGCCGTGCG GGCCCGCGAG GCCGCCGAGA CGGGGGAGGA CGCGAAGGCG CTCACCGCCT GGCAACGGGC CACCGCGCTG TTGCGCGGCG AGGCGCTGGC CGGGGTCGAT GGCGACTGGG CCGAGCGGTT CCGGCAGAGC TTCGGCGGCG AGGCGCGGTC GCTGCTGGCC GAACGCTACG GCTGGGAGCT GCGCTGGGGC CGCCACCACG CCGTCGTCGA CGAACTGCTG GCCGCCATGG TCCGGTACCC GACCTGCGAG AGCCTGGCCG AGCACCTGAT GCTGGCCCTG TACCGGTGCG GTCGGCAGGC CGAGGCACTG GCGGTCTTCG ATCGCACCGC GCGGCTGCTG CGGGACCGGC TCGGCGTCGA CCCTTCCGAG TCGCTGCGAA GGCTGCACCG CCGGATCCTC AACCAGGATG CCGGACTCGC CGCCCCCGAA CCGATGGCGT TCAAGACAAC CACGTCAGAA ACAACCACAG TAGACGATGC CCCGGCAGAC ACCGTTGTTC CGGCGCAATT GCCCGCCCCA CCACGCGCCT TCGTCGGACG CGAATCCGAA CTGGCGGCCC TCAAACGCGA CGCGGACCGC ACCTCGGTCC TGATCCTGGA CGGCATGCCC GGCGTCGGCA AGACCGCGAC CGCGGTACGT CTCGCCACCG AACTGGCCCA GCGATACCCC GACGGACAGC TGTTCCTCGA CCTGCACGGC TACTCCGGCG ACGTCCCGGC GGTCGAACCC GCCGAGGCAC TGGTGCGGCT GTTGCGCGGC CTCGGCGCCG AGGCGGACCA GATCCCGACC GGCCTTGACG AACGCTCAGC GGAGCTGCGC ACCCGGCTGG CGGGCCGTCG GGTGCTGATC CTGCTCGACA ACGCGGCCAC CAGCGCCCAG GTCCGGCCGC TGCTGCCGGG CGGGACCGAC TGCCTCACGA TCATCACCAG CCGCCGCCGG TTGCCCGACC TTCTGGAGGC GGCACCGGCG TCGCTCGACG TCCTGGAACC GGAGGAGGCG GTGCGGCTGC TGGTCGCCGC GGTGGACGAC CCGAAGCGGG TCGCCGAGGA CTCCGCCGAC ACCGCCGCGA TCGTCGAGGT CGCCGGTCGG CTGCCACTGG CGATCCGGCT GATCGCGGCC CGGCTGCGCA ACCGCCGCAA CTGGACCGCC GGGTTCATGC TGGGACGGTT GCGGGACGAG ACGATCCTGA GTGAACTGTC CGCACAGGAC GTCGCTGTGG CTTCGGCGTT CTCGATGTCC TATGCCGAAC TGGACGATGG TCATCGCCGG ATGTTCCGGC TGCTGGGCCT GTTTCCCGGG CAGGACTTCG ATGCCACCTT CGCGGCGGCC TTGGCCGACG TCGCGCCGGA GGCGGCCGAT CGGATGCTGG AGGACCTGGT CGACGCGCAC CTGTTGCGCA GCGCCGAGCC GGGACGGTAC CGGTTCCACG ACCTGATGCG CCACTACGCG GCGACCATCG CGACCGAGAC CGAGACGGCC GCCGAAGTCG AGGCCGCTCG GACGCGCTTG TACGACACCG CCGCGGTCAT GCTGCGTCAC GCTATTTCGC AGTACGACTC CTATGTGGGG TATTACCCAA GATTGATCGA AGCCGTCGAC CCGCCCGAGT CCCCGTGGCG AACGCGGGCC GAGGCGTCCG CGTGGTTCGG CACCGAACTG CCCAACCTGA AGGCGATGCT GCGATCCGCC AACGAGCACC GGATGGACCG TCACTGCGCG GAGATGGCGG CGGCGATGTC GGCCTACTAC AGCCACCATC ATGCTGACCA CGACCTGGAC CGTATCGGGG AATGGGGTCT GGGCAGTGCC CGCCGGATCG GCGATCGGGA GTGTGAGGGC TACTTTCTCA ACAAGCTCGC AGGGGCTTAT CAGGCTTGGG GGGACGTCGG CATGGCCGAA CGGCTCCACG AACAGGCGTT GGCCGTTCGG CGCGAGCTCG GCGACGCCCG CTATATCGTC TCCAGCCTGT CCAACCTCGC TCTCGTGCAT GGGCACTCCG GCGAATATGA CCGGGCTGTC GAGTTGCGCG GCGAGGCGCT CGCGCTGGCC GCCGACAATG GACTCGTCGA ACTCGAACGG CTCATCTGCG TCTACATGGC CGGTTCACTG TGCGAGAGCG GCCGAATCGC CGAAGCCCGG CTTCAGTTGG AGCGGGCCGG GGAACTGCTG ACCGGCTCCG ATGACGCGTT CGCTCGGATG AGTCTCGACG CGCACTGGGG AAACGTGAAA CGCGGTGAGG GCGATCCCGT CGAGGCCCAA CGGCTTCATG AACGGGCGTT GGCCGCGTAC GAGACCCACG GTCACTCGGT GGGACAGGTT CAGATGCACT CCGAGATCGC CAGCGACCTG ATCGCGCGGG AACTGTGGGA CGAGGCGTGG AAGTCCTGCG TCACGTCCAT GGAATTGCTG GGCGACACCG AACGTCCCGA GCTGCGGGCG GAGAACCTGC TGACGATGGC AGAGGTGTGC CTGGCACGCG GTCACGACGA GGACGCGTTC GAACAGCTCA GCGCGGTGGC GGACCTCGCC GAGAGCCGCG ATTCGGCCGT GCTGCGCGCC AAGGCCGACT GGGGCCTGGC CCGGGTCGCG TCCGCCAAGG GGGACAACGA AAACGCCGTC CAGCACGCCG AACGAGCCCT CGCCTACTAC TCCCGCTTCG ACACACCCCG CACCGAAGCC ATCCGCCGCT TCCTGTCCCA GCCCTGA
|
Protein sequence | MLVRLLGGVE VAGAEGQWRT VPGKRAGVLA VLAVAAGEPV PADELVHRVW GASAVALADG AVYPHITRLR ATLSPAGLTI SRARGGYVLD LDADQVDLLT IRALAVRARE AAETGEDAKA LTAWQRATAL LRGEALAGVD GDWAERFRQS FGGEARSLLA ERYGWELRWG RHHAVVDELL AAMVRYPTCE SLAEHLMLAL YRCGRQAEAL AVFDRTARLL RDRLGVDPSE SLRRLHRRIL NQDAGLAAPE PMAFKTTTSE TTTVDDAPAD TVVPAQLPAP PRAFVGRESE LAALKRDADR TSVLILDGMP GVGKTATAVR LATELAQRYP DGQLFLDLHG YSGDVPAVEP AEALVRLLRG LGAEADQIPT GLDERSAELR TRLAGRRVLI LLDNAATSAQ VRPLLPGGTD CLTIITSRRR LPDLLEAAPA SLDVLEPEEA VRLLVAAVDD PKRVAEDSAD TAAIVEVAGR LPLAIRLIAA RLRNRRNWTA GFMLGRLRDE TILSELSAQD VAVASAFSMS YAELDDGHRR MFRLLGLFPG QDFDATFAAA LADVAPEAAD RMLEDLVDAH LLRSAEPGRY RFHDLMRHYA ATIATETETA AEVEAARTRL YDTAAVMLRH AISQYDSYVG YYPRLIEAVD PPESPWRTRA EASAWFGTEL PNLKAMLRSA NEHRMDRHCA EMAAAMSAYY SHHHADHDLD RIGEWGLGSA RRIGDRECEG YFLNKLAGAY QAWGDVGMAE RLHEQALAVR RELGDARYIV SSLSNLALVH GHSGEYDRAV ELRGEALALA ADNGLVELER LICVYMAGSL CESGRIAEAR LQLERAGELL TGSDDAFARM SLDAHWGNVK RGEGDPVEAQ RLHERALAAY ETHGHSVGQV QMHSEIASDL IARELWDEAW KSCVTSMELL GDTERPELRA ENLLTMAEVC LARGHDEDAF EQLSAVADLA ESRDSAVLRA KADWGLARVA SAKGDNENAV QHAERALAYY SRFDTPRTEA IRRFLSQP
|
| |