Gene MCA1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1941 
Symbol 
ID3102722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2089178 
End bp2090482 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID637171096 
Productsugar ABC transporter, sugar-binding protein, putative 
Protein accessionYP_114374 
Protein GI53804010 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTTT CGTTTTCACC TCGGAATTTC AGACTCACCG GTATGGCCAT GAACTTTCGT 
TTGTCGCTTG TCATCCTGTC GTCCTGCCTC TGGCTTCTGA ACGCTTGCCA GCCGGGCCAT
GACGAAAGTT TGACGTTCTG GGCATTCGGC AGCGAGGGAG AGGTGGCCCA TGAGCTCGCC
AGCGAGTTCG AGAGGCTGAA TCCGGATGTC CGCGTCGAGG TCCAGCCAAT CCCCTGGAAC
GCCGCCCACG AAAAGCTGCT GACGGCTTAC GCCGGCGGCA GCCTGCCCGA CGTGTTCCAG
CTCGGCAATA CCTGGATAGC GGAATTCCAG GCATTGCGGG CACTGGACGA CGTCACCCGC
CGTTTCGGGA ACACCGGACA GAAGGGCGAC TACTTCGAAG GAGCGCTGGA CGCGACCTTC
GTCGAGGGCC GGATGTGGGC GGTCCCCTGG TATGTCGACA CGCGCATTCT GTTTTACCGG
CGCGACTTGC TGGAAGCGGT GGGGATCACA TCGCCACCCC GGCGCTGGGA CGGCTGGTTC
GAGGCGCTGT CCCGCTTCCA TGAGCGCACC GGGGCTTTCG GGCTGTTCCT TGCCATCGAC
GCCTGGGAGG TTCCCGTCGT TTTCGCGATG CAGCACGGAG CGGCCCTGCT GAAGGATGGG
GACCGCCATG GCGATTTCCT CCATCCCGGT TTCCGGGCAG CGTTCGATGC CTATCTGGGC
TGGTTCCGGG AAAAACTGGC GCCGGCGGAA AGCTCCGGCC AAGTGGCCAA TCTCTATCGG
GAATTCGAAC AGGGCTATTT CGCTGCGCTC ATCACCGGCC CGTGGAATCT CGGCGAGTTC
CGGCGGCGGC TGCGCGACCT CGACGCTGCC GCCTGGGACA CCGCGCCCTT GCCGTCTTTC
GACGATTCCT ATCCCGGCAT CTCCCTGGCC GGGGGAGGCA GTCTGGCGCT GGCCTACGCG
TCCAGACACA AGGATGCGGC CTGGCGGCTG ATCGAGTTCC TCACCGAGCC TCGGCAGCAG
GTTTACCTGT ACCGTCTCAC CGGCGATCTG CCACCGGGAC GGACCGCTTG GAGCGATCCC
GCTCTGGCAG GCGACCGCAA GGCACGGTCT TTCCGCAGCC AGATGGAACG TATCGCGCCC
TTGCCGAAGA TTCCCGAGTG GGAGCGGATT GCCAGCCGGA TCGCGGCCTG CGCCGAGCGG
GCGGTACGGG GCGAAGTACC CCCGCAGGCG GCACTGGAGG CATTGAACAG GGAAGTGGAC
CGGATTCTGG AAAAACGCCG CTGGCTGCTG GAGCGGGGGT TGTGA
 
Protein sequence
MSLSFSPRNF RLTGMAMNFR LSLVILSSCL WLLNACQPGH DESLTFWAFG SEGEVAHELA 
SEFERLNPDV RVEVQPIPWN AAHEKLLTAY AGGSLPDVFQ LGNTWIAEFQ ALRALDDVTR
RFGNTGQKGD YFEGALDATF VEGRMWAVPW YVDTRILFYR RDLLEAVGIT SPPRRWDGWF
EALSRFHERT GAFGLFLAID AWEVPVVFAM QHGAALLKDG DRHGDFLHPG FRAAFDAYLG
WFREKLAPAE SSGQVANLYR EFEQGYFAAL ITGPWNLGEF RRRLRDLDAA AWDTAPLPSF
DDSYPGISLA GGGSLALAYA SRHKDAAWRL IEFLTEPRQQ VYLYRLTGDL PPGRTAWSDP
ALAGDRKARS FRSQMERIAP LPKIPEWERI ASRIAACAER AVRGEVPPQA ALEALNREVD
RILEKRRWLL ERGL