Gene Mbur_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_2131 
Symbol 
ID3998214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp2237566 
End bp2238525 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content47% 
IMG OID637959867 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_566754 
Protein GI91774062 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.603164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTTG CAGCAGTTTT CCTGTCAGGA TGTACTTTTG CACCCGATGA TGGAACTTTG 
ACCGAAATAA ACATCGGCTA TCAGCCAAGC ACACACCAGA TTTCATATAT GACAGCCTTT
GAGAACGGCT GGTGGGCTGA GGACCTTGCA CCATTTGGTA TCATGAGCAT AAACGAATTT
GAGTTCCCAA CAGGTACTCC TGAGATGCAT TCAATGATCG CAGGAAACAT TGATGTTGCA
TACGTTGGCG CAGCACCTGT TATTTCCGCA CTCAGTACCG GACTTGATGC AAAGATCGTC
GCAGCAGTGA ACACACAGGG TTCTAATCTT GTGCTCAGAA ATGAGTTCAA ATATGATGGT
CCTGCAGACC TTGAAGGTCT AAAGATAGCA ACCTTCCCAC CGGGAACCAT ACAGGATACC
ATCTTCAAGG AATGGTTGGT AGATAATGGT CTTGAACCTG GTACAGATGT CGAAGTTGTC
GCAATGGGTC CTGGAGACGC AACTGCTGCT CTTGCAGCAG GTAAAGTAGA CGGTGTATTC
CTGCCACACC CAGCACCAAC GTTCATTGAA GTTGAAGGTT CCGGTCGTTC AGTTGTTGCA
TCCGGGGAAA TACTTGCAGA CCATGCATGT TGTGTGCTTG TGGTCAGTGG GGATCTTATC
AGGAACAACC CTGAACTGGT CGAACAGATC GTAAAGACCC ACATCAAGGC TATAGAGTAT
GATAATCTCA ACATCGATGA TGCAGCGAAC ACATTTGCTA ACAAGCAGGG TGTTGACAAT
GCAACTGTCC TTCAGTCCCT TGAAAACTGG GATGGTGTCT GGTCAGCTGA CCCACGTCCG
CTTGTGGAGT CCACAGTAGA ATACGCAAAC TTCCAGTATG AGCTTGGTTA TATCAGCAGC
CAGCTTACAG AAGAGGATAT CTTTGACGTG AGCTTCTACG AGAAGGTCTC TGAAGAGTGA
 
Protein sequence
MLVAAVFLSG CTFAPDDGTL TEINIGYQPS THQISYMTAF ENGWWAEDLA PFGIMSINEF 
EFPTGTPEMH SMIAGNIDVA YVGAAPVISA LSTGLDAKIV AAVNTQGSNL VLRNEFKYDG
PADLEGLKIA TFPPGTIQDT IFKEWLVDNG LEPGTDVEVV AMGPGDATAA LAAGKVDGVF
LPHPAPTFIE VEGSGRSVVA SGEILADHAC CVLVVSGDLI RNNPELVEQI VKTHIKAIEY
DNLNIDDAAN TFANKQGVDN ATVLQSLENW DGVWSADPRP LVESTVEYAN FQYELGYISS
QLTEEDIFDV SFYEKVSEE