Gene CPF_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1550 
SymbolmglC 
ID4202621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1769053 
End bp1770081 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content33% 
IMG OID638082428 
Productbeta-methylgalactoside transporter inner membrane component 
Protein accessionYP_695993 
Protein GI110799291 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4211] ABC-type glucose/galactose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCA AATCAAAGAA AAAGTTTAAT TTAAATAGCC AATGGTTAAT GAATAATGCT 
ATTTATATAG TTTTAGTGGC CTTATTAATA GGAATTTGTA TTATTTCACC AGATTTTTTA
TCACTAAAAA ACTTTATTAA TATATTAAGC CAATCCTCTT CTCGTATAAT AATAGCTCTT
GGAGTTGGAG GAATACTTTT AACTGAAGGT ACTGACCTTT CTGCTGGTAG AACAGTTGGA
CTTGCTGCCG TAGTATCAGC TTCTTTACTT CAAGCTGGAG ATTATGCATA TAAAATGTAT
CCTAATTTAC CTGAATTACC TATATTCATT CCAATTTTAA TAGCAATGGC TGTTTGTGGA
ATCGTTGGTC TTGTAAATGG ATTAGTTGTT TCTAAATTTA ATGTTCCTCC ATTTATAGCT
ACTCTAGGAA TGATGACAGG AATATATGGA CTTAACTCAA TATACTTTGA TAGACCTCCA
TATGGAGCTA TGCCAATAGG TGGTCTTAGC CAATCCTTTA GTAATTTTAC ACTTGGTTCC
ATTCCTATAT ATGGAAATAT AAAAATACCT TATTTAGTTA TATATGCAAT TATTGTAATA
GCTGTAATTT GGACTTTATG GAATAAAACT AAATTTGGTA AAAACCTTTA TGCCATAGGT
GGTAACAGAG AAGCTGCTGT GGTTTCAGGT GTAAATGTTG TTAGAACACT TTTATTAGTT
TATATGTTAG CTGGAGTTCT TTATGGTTTT GCAGGTACTC TAGAAGCTGG TCGTGTTGGT
AGTGCTACTG CTAGTACTGG TGAAATGTAT GAACTAGATG CCATAGCTTC CTGTGTTGTT
GGTGGAGTTT CCACTGCTGG TGGTGTTGGT ACTGTTCCTG GAATAGTAAC TGGTGTTTTA
ATATTCCAAG TTATAAACTA TGGTCTAGCT TTTATAGGTG TTAGCCCTTA TTTACAATTC
GTTATAAAAG GTTTAATTAT AGTTCTAGCT GTAGCTCTTG ATATGAGAAA ATACATGAAA
AAGAACTAA
 
Protein sequence
MQTKSKKKFN LNSQWLMNNA IYIVLVALLI GICIISPDFL SLKNFINILS QSSSRIIIAL 
GVGGILLTEG TDLSAGRTVG LAAVVSASLL QAGDYAYKMY PNLPELPIFI PILIAMAVCG
IVGLVNGLVV SKFNVPPFIA TLGMMTGIYG LNSIYFDRPP YGAMPIGGLS QSFSNFTLGS
IPIYGNIKIP YLVIYAIIVI AVIWTLWNKT KFGKNLYAIG GNREAAVVSG VNVVRTLLLV
YMLAGVLYGF AGTLEAGRVG SATASTGEMY ELDAIASCVV GGVSTAGGVG TVPGIVTGVL
IFQVINYGLA FIGVSPYLQF VIKGLIIVLA VALDMRKYMK KN