Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2469 |
Symbol | |
ID | 7979025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2496510 |
End bp | 2497775 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799271 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_002950431 |
Protein GI | 239827807 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TTGGATTAGC TTGGCAAATT TTTATTGGTC TCATTCTAGG GATTATCGTG GGAGCCATTT TTTACGGAAA TCCGAAGGTT GCCACTTATT TACAGCCTAT TGGAGATATT TTCCTTCGTT TAATCAAAAT GATTGTCATT CCGATTGTTA TTTCTAGCCT TGTAGTTGGA GTCGCCAGCG TTGGGGATTT GAAGAAGCTT GGAAAATTAG GCGGCAAAAC GATTATTTAT TTCGAGATTA TCACAACGAT CGCGATTGTC GTCGGTTTAT TGGCAGCGAA TATTTTTCAG CCAGGGACCG GCGTTAATAT GAAATCATTA GAAAAAACCG ATATTCAAAG CTATGTTGAT ACAACAAACG AAGTGCAGCA TCATTCGATG GTAGAAACTT TTGTTAATAT TGTTCCAAAA AATATTTTTG AATCGTTAAC CAAAGGGGAT ATGCTGCCGA TCATTTTCTT CTCTGTTATG TTCGGTTTAG GAGTAGCGGC GATTGGCGAA AAAGGAAAGC CAGTTCTTCA ATTTTTCCAA GGTACAGCAG AAGCGATGTT TTATGTAACA AACCAAATTA TGAAGTTTGC GCCGTTCGGC GTGTTTGCGC TGATTGGTGT AACCGTTTCT AAGTTTGGGG TAGAGTCGCT TATTCCGCTC AGCAAGCTCG TCATTGTTGT TTACGCAACG ATGGTGTTCT TTATCTTTGT CGTGCTTGGC GGTGTTGCTA AGTTATTTGG TATAAATATT TTTCATATTA TAAAAATTTT GAAAGATGAG TTAATTCTTG CTTATAGTAC AGCAAGTTCG GAAACCGTTC TTCCGAAAAT TATGGAGAAA ATGGAGAATT TCGGTTGTCC AAAAGCGATT ACATCCTTTG TCATTCCGAC AGGGTATTCT TTTAACTTAG ACGGTTCTAC GTTATATCAG GCGTTGGCGG CCATTTTTAT CGCGCAGTTG TACGGTATTG ACATGCCGAT TTCTCAACAA ATCTCGCTTT TGCTTGTGTT AATGGTGACT TCGAAAGGAA TCGCTGGGGT GCCGGGTGTA TCCTTTGTCG TGCTGCTTGC TACGTTAGGC ACGGTTGGGA TTCCGATAGA AGGATTAGCA TTTATCGCTG GAATCGACCG TATTTTAGAT ATGGCGCGCA CAGCAGTGAA TGTTATTGGC AACTCGTTAG CAGCGATCAT TATGTCAAAA TGGGAAGGCC AATATAACGA AGAAAAAGGA AAACAATACA TCGCGCAATT GCAGCAAAGT GCATAA
|
Protein sequence | MRKIGLAWQI FIGLILGIIV GAIFYGNPKV ATYLQPIGDI FLRLIKMIVI PIVISSLVVG VASVGDLKKL GKLGGKTIIY FEIITTIAIV VGLLAANIFQ PGTGVNMKSL EKTDIQSYVD TTNEVQHHSM VETFVNIVPK NIFESLTKGD MLPIIFFSVM FGLGVAAIGE KGKPVLQFFQ GTAEAMFYVT NQIMKFAPFG VFALIGVTVS KFGVESLIPL SKLVIVVYAT MVFFIFVVLG GVAKLFGINI FHIIKILKDE LILAYSTASS ETVLPKIMEK MENFGCPKAI TSFVIPTGYS FNLDGSTLYQ ALAAIFIAQL YGIDMPISQQ ISLLLVLMVT SKGIAGVPGV SFVVLLATLG TVGIPIEGLA FIAGIDRILD MARTAVNVIG NSLAAIIMSK WEGQYNEEKG KQYIAQLQQS A
|
| |