Gene MCA1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1521 
Symbol 
ID3103891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1623477 
End bp1624823 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID637170695 
Productmajor facilitator family transporter 
Protein accessionYP_113977 
Protein GI53804195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.537503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCT TGTCTCCGGC CACACAATTC CGGAGGATAG CCCGCTCCAA ACCCCTCCCC 
GACAAGGATT CCACCATCCC CATGACATCT GCACCCCCGT TGCGTCGCCG TCACGACCGC
ATCATCCTCG CCGGACTGAT CGGCAACGTG ATGGAGTGGT ACGACTTTGC CGTTTACGGC
TATTTCGCCG TTGTGATCGG CAAGCTGTTT TTTCCGGCGG ATGATCCCGC GGCTTCGCTG
ATCGCCTCGT TCGGAGCGTT CGCCGCTGGC TTCATCGTCA GGCCGGTCGG CGGATTGCTA
TTCGGCCGGA TCGGAGACCG CCTGGGGCGG CAGCAGGCAC TCACCTGGTC GGTCATGGCA
ATGGCCGTGC CCACGGTGCT CATGGCGTTC CTGCCGACCC ATGCTTCCGC CGGTATCGCC
GCCCCGGTTG CCATCGTCCT GCTCCGCATC GTTCAGGGCT TGTCCGTCGG CGGGGAATTT
ACCAACTCCC TCGTGTTTCT GGTGGAGAAT GCGCCGGGCG AACGTCGGGC CTTCACCGCA
GTGTGGGGAA GTTGGGGCGC ATCTGCGGGC ATACTGCTGG GATCGGGTGC AGGTGATCTG
CTGACCCATG TCCTGAGTGA AGAACAAGTC CTGAACTGGG GCTGGCGTTT GCCGTTCCTG
GCCGGGGGGC TGGTGGCGCT AACGGGTTAT TGGCTCCGCC AGGGGCTGGA GCCGGAACTT
CCGAATGCGG AACACGCCAG CCCGGTCCGG GCCGTGTTCG CCAGGCACAA AGGGGCGATG
CTGCGGGTTG CGCTGCTGAA CCTCGGTTTC GGCGTGGGCT TCTACGCTGC CTTCATCTAT
GCCGTGAGCT ACATCAAGAA CATCGACCAT CTGCCGGACG CGACGGTTTT CAATCTGAAC
ACCTGGGCGA TGGCTCTGCT TCTGGTCCTG CTGCCCGTCG CAGCCTGGGC GTCCGACCGG
TTTGGCCGCA AGCCGGTGCT GGCCGCCGGC TTCGGCCTGC TCGCACTGGG CGCGATTCCC
CTGTTTCATC TGATCCACAC CGCCGACCCT CCCACCATCT TTCTGGGTGA AGCCGGCTTT
GCACTGACCA TCGGTTTGAT CAGCGGCGGC ATCGTCGCCA CCAACGTCGA GCTGGTGCCG
GCGGAGGTAC GCTGCACCGG TCTGGCCTTT GCCTACAATG CGGCGGTGGG GTGCTTCGGC
GGCAGCACAC CGCTGATCGC GGCGTGGCTG ATCGACCGGA CCGGCAACCC GCTCACGCCT
GCCTACTGGA TCGCGGCAAC GGCCACGGTG TCACTGATCA CGCTCGTCGC ATTCGTGCGC
GAGTTCCACT TTCACATGCC CCGCTAA
 
Protein sequence
MASLSPATQF RRIARSKPLP DKDSTIPMTS APPLRRRHDR IILAGLIGNV MEWYDFAVYG 
YFAVVIGKLF FPADDPAASL IASFGAFAAG FIVRPVGGLL FGRIGDRLGR QQALTWSVMA
MAVPTVLMAF LPTHASAGIA APVAIVLLRI VQGLSVGGEF TNSLVFLVEN APGERRAFTA
VWGSWGASAG ILLGSGAGDL LTHVLSEEQV LNWGWRLPFL AGGLVALTGY WLRQGLEPEL
PNAEHASPVR AVFARHKGAM LRVALLNLGF GVGFYAAFIY AVSYIKNIDH LPDATVFNLN
TWAMALLLVL LPVAAWASDR FGRKPVLAAG FGLLALGAIP LFHLIHTADP PTIFLGEAGF
ALTIGLISGG IVATNVELVP AEVRCTGLAF AYNAAVGCFG GSTPLIAAWL IDRTGNPLTP
AYWIAATATV SLITLVAFVR EFHFHMPR