Gene Moth_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1973 
Symbol 
ID3831155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2057287 
End bp2058294 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content55% 
IMG OID637829904 
ProductNLPA lipoprotein 
Protein accessionYP_430814 
Protein GI83590805 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000524772 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGA TGCCAAGAAC TGTTACCCTG ATTTTGGTCC TAACGCTGAC CGCCGGCCTT 
CTGGCCGGCT GCGGTTCCCA AAAAACCGCA ACTTCTGGCG AAAAACCCGT AATCAAAATA
GGCTACCTGC CCATTACCCA TTCCCTTCCA CTGGTAGTGG CCGATGCCCG GCATAAATCC
GACTTTAACA ACTTCCGGCT GGAGCTGGTT AAATTCGGCT CCTGGCCGGA TCTTACTGAG
GCCTTAAATT CCGGCCAGAT CCAGGGAGCT ATCACCATGC TGGAGCTGGC TCTGGCAAGT
AAGGCCAAAG GTATTCCCGT TGAAGTAGTG TTGTTAAGCC ACAAAAACGG TGATGTTCTG
GTGGCCGCCC CTCCGATTAA AGATGTAAAA GATTTAAAAG GAAAAAGGGT GGCCATCCCC
CACCGCCTGT CCGGTCATAA TATTCTTTTA TACAAGGCCC TGCAGCAGGC GGGCCTGGCT
TACAGCGACG TCCAGGAGGT AGAAATGGCC CCGCCGGAAA TGGCCGCCGC CCTGGCCAGG
GGCGAAGTGG CGGCCTACGT GGTAGCCGAA CCCTTCGGCG CCCAGGCAGT GGTAGCTGGC
ACGGGACGGG TACTAAAACG GGCCCAAGAT ATAATCCCAG GCTGGGAGTG CTGTGGCCTG
GTTATAAACC AGCAGTTGGT CAGGGAAAAT CCGGCCGCCG TCCAGGAACT GGTCGGCAGC
CTGGTAGACA CCGGTCATTA TATAATGAGC GATCGCAGGA CGGCCATCGA AATGGCCCGG
CCGTATATAC CAGTGGCCAG GGAAACCTTG GAGCAGTCCC TGCAGTGGAT CGATTATAGC
GATCTCATGC CTACAACAGA GGGACTGGCC AGGATCGAAC AGTACCTCAA AGAAATACCC
TGGGATGGCC AGCCGGGACG CCTGTTACCT GGTGGAGAAA TTAAACTGGA AGAACTGGTC
GACGACCGCT TCGCCCGGCA GGCCATATTA CCTACGCCGA AAAATTGA
 
Protein sequence
MKVMPRTVTL ILVLTLTAGL LAGCGSQKTA TSGEKPVIKI GYLPITHSLP LVVADARHKS 
DFNNFRLELV KFGSWPDLTE ALNSGQIQGA ITMLELALAS KAKGIPVEVV LLSHKNGDVL
VAAPPIKDVK DLKGKRVAIP HRLSGHNILL YKALQQAGLA YSDVQEVEMA PPEMAAALAR
GEVAAYVVAE PFGAQAVVAG TGRVLKRAQD IIPGWECCGL VINQQLVREN PAAVQELVGS
LVDTGHYIMS DRRTAIEMAR PYIPVARETL EQSLQWIDYS DLMPTTEGLA RIEQYLKEIP
WDGQPGRLLP GGEIKLEELV DDRFARQAIL PTPKN