Gene Moth_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0383 
Symbol 
ID3832627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp387720 
End bp389132 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content46% 
IMG OID637828320 
Productanion transporter 
Protein accessionYP_429260 
Protein GI83589251 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA AAAATACAGC TGCTGGCTCG TACGGCAAAG TGTTAGCAGG TCTGTTAGCT 
ATTATTGTCT ACATTATACT GACAAATTTA CCCACTCCTG CGAATTTGCC GCCCCAAGGC
CAAAAAGCCC TGGCCTTTAT GGTCGCTGTC GTGATCGTCT GGGTTTTTGA AGTCATTCCT
ATCGGTATTT CCGCAGCTCT GTTCTTAATG ATTATGGATA TACTAAAGGT CTTCCCTATG
AAGGATGCCA TGGCGAATTT TGCTACTACC ACTCTTTTCT TTATCTTATC AGCGTTTATT
ATAGCCATAA CTTTCATCAA TACTGGGCTT GGGAACCGCG TTTCGTTAAT GGTGAGCGCT
ATCTTTGGGC AGAAAACTGA TAGAGTCCTG CTGAGCTTTA TGTTACCTAC AGCTATTATT
TCTAGCGTAC TAGCGGACAT TCCCACAGCT GTGATTTTTT CGAGTATAGC ATATCCTCTT
CTACAGAAAA ATGGCTGCCT TCCGGGGAAG TCAAATTTTG GCAAGGCCTT GATGTTGGGG
ATTCCTATTG CCGCAGCTAT TGGCGGTATT GCTACCCCTG CGGGTAGTGG TCTCAATATC
ATGTCTATTT CACTCCTCAA GAACACGGCC GGCGTCGAGA TTAATTTTTT ACAATGGGCG
CTTATCGGAT TTCCTATGGC AATCTTACTC ACCCTGGCAG CCTGGTATAT TGTGCTAAAA
TTTTATCCGC CCGAATTTGA CCACGTACGG GGATTGGAAG ATATCGCGAA AGCCAGACAG
GATCTTGGCC CTCTCACGGT CAACGAGAAA AAATTCATAG CCATCTTCTC CGTCACGTTG
GTCTTGTGGT TTACTCAGCC ATGGAATCAT ATCGATCCCT CGGTAGTTGC TATTATCACG
GCTTCCTCAT TTTTCCTGCC GGGAGTCAAA TTAGCAACCT GGGATGATGT CAAAGGAAAA
TTGAGCTGGG ATGTTTTACT CCTACTAGGG ACTGCCAACA GTCTGGCCAT GGCGATTTGG
CAGCTCAAGG GAGCTGCTTG GCTGGCCAAC ACGGTCCTGG GTGGATTGGC TGGTGTCGGC
CTCCTGATAG TATTGTTCGC CGTTACAGCT TTCGGCATCT TCTCCCACTT AATTATACCT
GTAGGTGGTG CCGTAGTGGC TGTAGCCATT CCGGTACTCG CAGTACTGGC TAAAAATACC
GGGATCAATC CTGCCCTGCT AGTTATTCCA ATTGCGTATA CTGCGTCTTG TGTATTTTTA
TTACCTCTGG ATCCCATTCC GCTAACCACA TATCATTACA AATATTGGAA ATTTTGGGAC
ATGATGAAAC CAGGTTTCCT TATTTCCCTC GTCTGGTTGG TTTTAATGGT TATATTTATG
TATATAGGAC AGGGCGTTGG AATAATACGA TAA
 
Protein sequence
MDAKNTAAGS YGKVLAGLLA IIVYIILTNL PTPANLPPQG QKALAFMVAV VIVWVFEVIP 
IGISAALFLM IMDILKVFPM KDAMANFATT TLFFILSAFI IAITFINTGL GNRVSLMVSA
IFGQKTDRVL LSFMLPTAII SSVLADIPTA VIFSSIAYPL LQKNGCLPGK SNFGKALMLG
IPIAAAIGGI ATPAGSGLNI MSISLLKNTA GVEINFLQWA LIGFPMAILL TLAAWYIVLK
FYPPEFDHVR GLEDIAKARQ DLGPLTVNEK KFIAIFSVTL VLWFTQPWNH IDPSVVAIIT
ASSFFLPGVK LATWDDVKGK LSWDVLLLLG TANSLAMAIW QLKGAAWLAN TVLGGLAGVG
LLIVLFAVTA FGIFSHLIIP VGGAVVAVAI PVLAVLAKNT GINPALLVIP IAYTASCVFL
LPLDPIPLTT YHYKYWKFWD MMKPGFLISL VWLVLMVIFM YIGQGVGIIR