Gene MCA1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1944 
Symbol 
ID3102159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2091907 
End bp2092977 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID637171099 
Productsugar ABC transportor, ATP-binding protein 
Protein accessionYP_114377 
Protein GI53804000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGTA TCGCATTCGA ACGCCTGAGC AAAACCTACC CGGGCGGTTT TGCCGCGCTG 
TCCGACCTGA GTCTCGACAT CGCCGACGGC GAACTCCTGG TCGTCGTCGG ACCGTCGGGG
TGCGGCAAAT CCACGCTGTT GCGACTGATC GCCGGGCTGG ACCGCCCCAC CGCCGGCTCG
ATCCGGATCG GCGGCACGGA CGTGAATGCG CTCGCGCCTG CGGAGCGCAA CGTCGCCATG
GTTTTCCAGG ATTACGCGCT CTATCCCAAC ATGACGGTGA GGGGCAACCT CGAATTTCCG
CTGAAGATGC GCCGCATCGG TCGAGCCGAA CGCCGGCGCC GGATCGAGCA GGTGGCCGGG
ATGCTCGAAC TCATGCCGCT GCTCGACCGC CGCCCCGCCC AGCTCTCGGG CGGCCAGCGT
CAGCGCGTGG CCATGGGACG GGCGCTGGTC CGCGATCCCT CGGTGTTCCT GCTCGACGAA
CCCCTCGCCA ACCTGGACGC CCGTCTGCGC GCGCAGGTTC GTGCCGACAT TGCCGAACTG
CAGCGGCGGA CGGGGACGAC CATGATCTAT GTCACCCACG ACCAGGTCGA GGCCATGACC
CTGGGCCAGC GCATCGCGGT CCTCGCCGGC GGGCGGTTGC AACAGGCCGC CTCGCCCCGG
GAACTCTATG CCCATCCGGC CAACACTTTC GTCGCCGGTT TCATAGGCAA TCCACCCATG
AATCTCCTGC CGGTTCGGAT CGGGCGCTCC GGCGAGACCG TCCTCCCCGA TTTCGGCGGC
GTGCCACTGC CGGGCAGGGC GGATGCGCCG AAAGATGCAG CTATCGCCGG CATCCGGCCG
GAAGCTGTCC GCCTGGCGGA AACGTCGTCC GAGGGGATCG CCGTCCGCGT CCGGGAGGTG
GAATATCTCG GCCATGAAAC CTTGCTGCAT TTCACCCATG AAGCGGGAGC GGCCAGCCTC
ATCGCCCGCC TTCCCGGTCT GCCGCCCTTT GGCCGCGGCG ATGCCGTCCG TCTCGGGATG
GCGCCGGAAG ACTGGCATTT CTTCGATCGG AGCGGTCAGG CGTTGGGCTG A
 
Protein sequence
MGSIAFERLS KTYPGGFAAL SDLSLDIADG ELLVVVGPSG CGKSTLLRLI AGLDRPTAGS 
IRIGGTDVNA LAPAERNVAM VFQDYALYPN MTVRGNLEFP LKMRRIGRAE RRRRIEQVAG
MLELMPLLDR RPAQLSGGQR QRVAMGRALV RDPSVFLLDE PLANLDARLR AQVRADIAEL
QRRTGTTMIY VTHDQVEAMT LGQRIAVLAG GRLQQAASPR ELYAHPANTF VAGFIGNPPM
NLLPVRIGRS GETVLPDFGG VPLPGRADAP KDAAIAGIRP EAVRLAETSS EGIAVRVREV
EYLGHETLLH FTHEAGAASL IARLPGLPPF GRGDAVRLGM APEDWHFFDR SGQALG