Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0909 |
Symbol | |
ID | 5877206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 933182 |
End bp | 934822 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641541265 |
Product | carbon starvation protein CstA |
Protein accession | YP_001662546 |
Protein GI | 167039561 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1966] Carbon starvation protein, predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000299613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCAA TAATTTTACT GATAAGTGCT ATTATTCTTT TTGTACTTGC TTATATCTTT TACGGAGGCT GGCTTGCAAA ACAGTGGGGT TTGCAGCCAG ACAATAATAC CCCTGCACAC ACCATGTATG ATGGCGTGGA CTATGTCCCT GCAAAGGCAC CTGTCCTTCT GGGCCATCAC TTTGCATCCA TAGCCGGTGC CGGTCCTATA AACGGACCTA TACAAGCCGC TATATTTGGA TGGGTACCAG CAACACTCTG GATTCTTTTA GGCGGTATAT TCCTAGGTGG AGCCCATGAT TTTGGTTCAC TTCTTGCATC ATTGAGGCAT AAGGGCAAGT CCATAGGTGA TATCATACAA GCAAATGTTG GCTTGACAGC AAAGAGGTTG TTTCTTTTGT TTTCCTGGTC GACACTGGTA CTCATAGTTG CAGCCTTTAC AAACATAGTG GCTGATACTT TTGTCTCTAC ACCCCAGGCA GCCACTGCTT CACTGCTTTT TATACTATTT GCCGTGGTAT TTGGATTTGC TGTATGCAGA AGGAATGCAC CACTTGGCTT GAGTACCGTG TTGGGTGTGG CGGCTTTGGC GTTATCTATA TGGATAGGTT ATATAGCCCC GTTATCATTG CCTAAGACCA CATGGATAAT CATCCTTGCC GTGTATATAG TAGCTGCATC AGTTATGCCT GTATGGATAC TCCTGCAGCC AAGGGATTAC TTGAACTCTT TCCTTCTCTA TGGAATGATG ATTGGCGGTG TTGTTGGTAT ACTGCTTTAT AACCCGAGGA TTCAACTTCC AGCGTTTACA AGCTTTAAGG TTGGTACGCA GTATCTCTTC CCGATGCTGT TCATTACCAT AGCATGCGGT GCCATATCCG GTTTCCACTC ACTGGTGGCC TCCGGCACCA CAGCAAAACA GCTGGACAAA GAAGGGGATG CTAAGTTAAT AGGCTATGGC TCTATGCTTA TAGAGAGCAC CTTAGCCATA ATTTCTATAA TTACCGCAGC GTATCTTACT CAAGATAAGT TTGCAGAGCT TATTAAGTTT GGTCCAACCA ATGTGTTTGC TGATGGATTA GGTACATTCA TGGCCAGCTT TGGTATAAAT CAGGTTGTAG GAAAGACCTT TGCTGCCCTT GCTGTGTCTG CATTTGCCAT GACCACGCTG GATACAGCCA CAAGGCTCGG CAGGTTTGCC TTCCAGGAGT TTTTTGAAAA TATATCGGAG TCTGGAGAGA CAGTGGCATC TTCTAACCCT GTGGCTAAAT TCTTTAGTGA CAGGTATGTG GCATCTGTAA TCACTGTGGT TATTTCTATT GTACTTGCAT TTACAAGCTG GAAGGCTATA TGGCCTATAT TTGGTGCGGC TAACCAGCTC CTCGCAGCGG TAGCACTGCT GGCAGTGGCT GCATGGCTTG CCAATGCAGG TAGGAACAAT AAAATGCTCA TTATCCCTAT GATATTTATG TTTATAGTTA CTTTGACAGC CCTTGTACTT CTGATTCAAT CTAATATTGC TTCTGGTAAC TATATACTGG TGTTATTTGC CGTCCTGCTG TTTATCCTGG CTATACTATT AATACTTCAA ACATATGGAG TGCTTACAGG AAAGAACAGA AGAGAAGGGG TAGCAAAATA G
|
Protein sequence | MSAIILLISA IILFVLAYIF YGGWLAKQWG LQPDNNTPAH TMYDGVDYVP AKAPVLLGHH FASIAGAGPI NGPIQAAIFG WVPATLWILL GGIFLGGAHD FGSLLASLRH KGKSIGDIIQ ANVGLTAKRL FLLFSWSTLV LIVAAFTNIV ADTFVSTPQA ATASLLFILF AVVFGFAVCR RNAPLGLSTV LGVAALALSI WIGYIAPLSL PKTTWIIILA VYIVAASVMP VWILLQPRDY LNSFLLYGMM IGGVVGILLY NPRIQLPAFT SFKVGTQYLF PMLFITIACG AISGFHSLVA SGTTAKQLDK EGDAKLIGYG SMLIESTLAI ISIITAAYLT QDKFAELIKF GPTNVFADGL GTFMASFGIN QVVGKTFAAL AVSAFAMTTL DTATRLGRFA FQEFFENISE SGETVASSNP VAKFFSDRYV ASVITVVISI VLAFTSWKAI WPIFGAANQL LAAVALLAVA AWLANAGRNN KMLIIPMIFM FIVTLTALVL LIQSNIASGN YILVLFAVLL FILAILLILQ TYGVLTGKNR REGVAK
|
| |