Gene Moth_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1863 
Symbol 
ID3831494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1924905 
End bp1925963 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID637829795 
Productcobalt transport protein CbiM 
Protein accessionYP_430706 
Protein GI83590697 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0310] ABC-type Co2+ transport system, permease component 
TIGRFAM ID[TIGR00123] cobalamin biosynthesis protein CbiM 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.292788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATATTC CCGACGGTTA CTTAAGTCCC CAGACCTGTG CGGTACTGGG TGCGGCCATG 
GTGCCTGTGT GGGGCACGGC CGCCCGCAAG GTCAAAGCCA CCCTGAAGGC CAGGCAGGCT
CCCCTCCTGG CCATCGGCGC CGCTTTTTCC TTCACTATCA TGATGTATAA TATCCCCATC
CCCGACGGGA CGACGGCCCA CGCCACCGGC GGCGCCCTAT TAGCCATCCT CCTGGGCCCG
TGGGCGGCGG CTATCGGCAT CTCTATCGCC CTGGCCATCC AGGCCCTTTT CTTCGGCGAC
GGCGGCATCC TGGCCTTCGG CGCCAATGCC TTTAATATGG CCTTTATCCT TCCCTTCGCC
AGTTACTACA TCTATCGCCT CTTATCCGGC CGGACAAGCC TGCACTCCGG CTGGCGGGCC
GTGGCGGCCG CCATCGCCGG GTTTGTCGGC CTTAACCTGG CCGCCCTGGC CGCAGCGGTG
GAATTCGGCC TGCAACCCCT CCTCTTCCAT ACGGCCAGCG GTGTTCCCCT CTACAGCCCC
TACCCCCTGG CCTTAGCCGT CCCGGCCATG GCCCTGGCCC ACGTCCTGAT AGCCGGGCCG
GCCGAAGGAA TAGTCACCGG CCTGGTTATT CGCTACCTGC AGCGGGTTAA TTCCGGTCTG
CTGCGGGTTT ACCCGGCGAC AGGAGCTGTG GTGGCAGCTC AAGCGACGGG TGACGGTGCC
AGCCTTAAGA AACTGGCCTG GGGGTTGGTT ATCCTCGTCC TGTTATCCCC CCTGGGGTTG
CTGGCTGCCG GTACCGCCTG GGGCGAGTGG TCACCGGAAG ACCTGCAACA AATCCTCGGT
TTCGTTCCCC CGGGTCTGGC CCGCCTGGCT ACCACCTGGA CTCATGCCCT TTTCCCGGAT
TATACCGTCC CGGGCCTGGA GGGGAGCTTT TGGGCCCAGG CCCTGGGTTA TATCATCACC
GCCATGGTTG GGCTGGGCAT AATCTTCCTT ATCTTCCTGG CTTTTAACCG GCTCCTGGCC
CGGCCGGGGA AGACAGGAGC CGATTATCAT GGCAAATAA
 
Protein sequence
MHIPDGYLSP QTCAVLGAAM VPVWGTAARK VKATLKARQA PLLAIGAAFS FTIMMYNIPI 
PDGTTAHATG GALLAILLGP WAAAIGISIA LAIQALFFGD GGILAFGANA FNMAFILPFA
SYYIYRLLSG RTSLHSGWRA VAAAIAGFVG LNLAALAAAV EFGLQPLLFH TASGVPLYSP
YPLALAVPAM ALAHVLIAGP AEGIVTGLVI RYLQRVNSGL LRVYPATGAV VAAQATGDGA
SLKKLAWGLV ILVLLSPLGL LAAGTAWGEW SPEDLQQILG FVPPGLARLA TTWTHALFPD
YTVPGLEGSF WAQALGYIIT AMVGLGIIFL IFLAFNRLLA RPGKTGADYH GK