Gene Moth_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2065 
Symbol 
ID3831096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2157647 
End bp2158873 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID637829993 
ProductSodium/hydrogen exchanger 
Protein accessionYP_430903 
Protein GI83590894 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0475] Kef-type K+ transport systems, membrane components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0630296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC AGGTGCTCCT TGAAATCGGG CTAGCATTAG CTATCGTAGC TTTTGCCGGG 
ATCCTTGCCG CCAGGTTCCG CGTTTCTATT GTACCGCTCT TGATCCTTGC CGGTATGGTT
GTCGGCCCTC ATGCGCCGGT AATTGGTATC CTGGATTTTC GCTTTATTAA AAGCGCGCCT
TTAATAGATT TTATGGGGCG GGTAGGGATA CTCTTTCTCC TTTTCAGCCT CGGCCTGGAG
TTTTCCGTCG GGAGATTATT AAAGGCAGGC CGTTCTATAC TGGTGGGCGG GTCCATCTAT
ATGGCCATAA ATTTTACCCT GGGCATGGTT CTACCTATAA TTTGGGGTTG GCCGTTGCGG
GAAACCCTGG TGGTAGCCGG GCTTATTTCT ATTTCCTCAA GCGCCATTGT TGCCAAGGTT
CTGGTTGACC TAAAGCGAAC GGCACGGCCG GAAACCGAGA TGATCCTGGG GCTTATGTTA
TTCCAGGACG TATTCGTAGC AGTGTATCTA TCCATCATTT CCGGTCTGGT CCTTACAGGT
TCGGCCTCAC CGGCGAGCGT GTTGAAATCT ACCTCCCTTG CCCTGGGATT TATGCTGGGC
TTAATTCTCG CCGGCCGCAA ACTGGCACCG TTAATTAACA GGCTGCTTAA CGTTCCTTCC
GATGAAGTTT TTATGCTCAT AGTCTTTGCT TTCCTCACCC TGGTAGCCGG TTTTTCAGAG
ACTATCCATG TGGCGGAAGC TATTGGCGCC TTGCTGGTGG GTTTAATTTT AGCCGAGACA
GACCATCTCG ACCGCATCGA GCATATTGTC GTGCCGTTCC GTGATTTTTT CGGGGCCCTG
TTTTTTTTCA GCTTCGGTTT GAGCATCGAC CCTTTAACCT TGGGAGGGGC CGTCGGGCCG
GTTTTGACTG CCGTAGCGGC AACATTAACA GGCAATTTTC TTGCTGGTAT TCTTGCCGGA
CGAATGGCCG GTTATTCGTA CCGGGGGTGT ACCAATATCG GGCTTACTAT TACTCCCCGC
GGAGAATTTT CTATCATCCT CGCCAATCTG GCGAAGACCG GCGGATTACT GCCGGTGCTA
CAACCCTTCG CAGCCCTGTA TGTGTTGCTT ATGGCTATTC TGGGTCCTTT ACTTACGAAA
GAATCTAAAT GGATATACAA CCAACTGGCT CACATCTTTG GTTGGCCTGC CTGGAAAGAG
ATAAATAAAC CCGATAGAAC GATATAA
 
Protein sequence
MPEQVLLEIG LALAIVAFAG ILAARFRVSI VPLLILAGMV VGPHAPVIGI LDFRFIKSAP 
LIDFMGRVGI LFLLFSLGLE FSVGRLLKAG RSILVGGSIY MAINFTLGMV LPIIWGWPLR
ETLVVAGLIS ISSSAIVAKV LVDLKRTARP ETEMILGLML FQDVFVAVYL SIISGLVLTG
SASPASVLKS TSLALGFMLG LILAGRKLAP LINRLLNVPS DEVFMLIVFA FLTLVAGFSE
TIHVAEAIGA LLVGLILAET DHLDRIEHIV VPFRDFFGAL FFFSFGLSID PLTLGGAVGP
VLTAVAATLT GNFLAGILAG RMAGYSYRGC TNIGLTITPR GEFSIILANL AKTGGLLPVL
QPFAALYVLL MAILGPLLTK ESKWIYNQLA HIFGWPAWKE INKPDRTI