Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0517 |
Symbol | |
ID | 7978233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 587229 |
End bp | 588494 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644797518 |
Product | C4-dicarboxylate transporter DctA |
Protein accession | YP_002948692 |
Protein GI | 239826068 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.946656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGGTA AGTTTAAAAA CTTAACCGTA CAAGTAATTA TTGGAATTAT ACTGGGGATT ATCGTTGGTT TTTTATTCCC TGAATTTGGA TCAAAATTAA AAGTGCTTGC AGATGCATTT ATTAAGCTAA TTAAAATGGT CATTGCGCCG ATTATTTTCT TCACGGTCGC AATTGGAATC GGCAGCATGG GGGATTTGAA AAAGGTTGGC CGCATCGGTG GAAAAGCGCT TATTTACTTT GAAATTATTA CTACGTTCGC TTTAGCAATC GGAATTATTG TGGTTAATTT GATCAAACCT GGGGTAGGGT TTAATACTGA TGCCGTAAAA GGCGGCGATG TATCACAGTA TACGAAGCAA GCAGAAGAAG TGAATCATGG CGTCATTGAG TTTTTGCTTG GCATCATCCC GGATAACGTT GTTGGGGCGC TAGCGAAAGG CGAATTATTG CCGATTTTAT TCTTTGCCGT ATTATTCGGC CTTGCCGCGG CGGGGTTAGG AGAAAAAGCC AAACCGGTTA TTACCTTATT TGAGCGTTTG GCGGACATTT TCTTCGGTGT TGTTAATATG ATCATGAAAG TATCGCCAAT TGCAGCGTTC GGTGCGATGG CGTATACGAT TGGCACATTT GGAATCGGTT CATTGCTTTC GCTTGGAAAA TTAATGGCTT CTGTGTATAT TACGATGGCG CTATTTATTA TTGTCGTACT CGGACTTATC GCGAAGTTTT ACGGGTTTAA TATTTTTAAA TTTATTGCTT ATATCAAAGA GGAAATTTTA CTTGTGCTTG GCACATCTTC TTCTGAATCG GCATTGCCAA AACTGATGGA GCGCCTTGAA AAATACGGGT GTTCGAAGCC AGTTGTTGGG CTTGTTGTGC CGACAGGATA TTCCTTTAAC CTCGATGGAA CATCTATTTA TCTTTCGATG GCGGCGATTT TTATCGCCCA AGCTTACGGT ATCGATTTAA GCATTTGGCA AGAGCTTACG TTGCTCGGAA TTTTAATGTT AACGTCAAAA GGTGCAGCAG GGGTCACAGG TTCCGGATTT ATTACACTTG CGGCGACTTT GGCAGCGTTC CCGATGATTC CAGTAGAAGG AATCGCGTTA TTGCTTGGCG TAGACCGCTT TATGTCGGAA GCGCGTGCCA TTACAAACTT AATTGGCAAT GCTGTAGCGA CTGTTGTTGT TTCTAAGATG GAAAATGAAT TTCATCCTTC TGAAGAACAA CATGCAGAGA GAACGAAGAT GGTTGTTGCA AAGTAA
|
Protein sequence | MRGKFKNLTV QVIIGIILGI IVGFLFPEFG SKLKVLADAF IKLIKMVIAP IIFFTVAIGI GSMGDLKKVG RIGGKALIYF EIITTFALAI GIIVVNLIKP GVGFNTDAVK GGDVSQYTKQ AEEVNHGVIE FLLGIIPDNV VGALAKGELL PILFFAVLFG LAAAGLGEKA KPVITLFERL ADIFFGVVNM IMKVSPIAAF GAMAYTIGTF GIGSLLSLGK LMASVYITMA LFIIVVLGLI AKFYGFNIFK FIAYIKEEIL LVLGTSSSES ALPKLMERLE KYGCSKPVVG LVVPTGYSFN LDGTSIYLSM AAIFIAQAYG IDLSIWQELT LLGILMLTSK GAAGVTGSGF ITLAATLAAF PMIPVEGIAL LLGVDRFMSE ARAITNLIGN AVATVVVSKM ENEFHPSEEQ HAERTKMVVA K
|
| |