Gene CPR_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1343 
SymbolmglC 
ID4205964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1515105 
End bp1516133 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content33% 
IMG OID642565897 
Productbeta-methylgalactoside transporter inner membrane component 
Protein accessionYP_698663 
Protein GI110803947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4211] ABC-type glucose/galactose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.854381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCA AATCAAAGAA AAAGTTTAAT TTAAATAGCC AGTGGTTAAT GAATAATGCT 
ATATATATAG TTTTGGTAGT TTTATTAATA GGAATTTGTA TTATTTCACT AGACTTTTTA
TCATTAAAAA ACTTTATTAA TATATTAAGC CAATCCTCTT CTCGTATAAT AATAGCCCTT
GGGGTTGGAG GAATACTTTT AACTGAAGGT ACTGACCTTT CCGCTGGTAG AACAGTTGGA
CTTGCTGCCG TAGTATCAGC TTCATTACTT CAAGCTGGTG ATTATGCATA TAAAATGTAT
CCTAATTTAC CTGAATTGCC TATATTCATT CCTATTTTAA TAGCAATGGC TGTTTGTGGA
ATCGTTGGTC TTGTAAATGG ACTAGTTGTT TCTAAATTTA ATGTTCCTCC ATTTATAGCT
ACTCTAGGAA TGATGACAGG AATATATGGA CTTAACTCAA TATACTTTGA TAGACCTCCA
TATGGAGCTA TGCCAATAGG TGGTCTTAGT CGATCCTTTA GCAATTTTAC ACTTGGATCA
ATCCCCATAT ATGGAAATAT AAAAATACCT TATTTAGTTA TATATGCAAT TATTGTAATA
GCTGTAATTT GGACTTTATG GAATAAAACT AAATTTGGTA AAAACCTTTA TGCTATAGGT
GGTAACAGAG AAGCTGCTGT GGTTTCAGGT GTAAATGTTG TTAGAACACT TTTATTAGTT
TATATGTTAG CTGGAGTTCT TTATGGTTTT GCAGGTGCCC TAGAAGCTGG TCGTGTTGGT
AGTGCTACTG CTAGTACTGG TGAAATGTAT GAATTAGATG CCATAGCTTC CTGTGTTGTT
GGTGGAGTTT CCACTGCTGG TGGTGTTGGT ACTGTTCCTG GAATAGTAAC TGGTGTTTTA
ATATTCCAAG TTATAAACTA TGGCCTAGCT TTCATAGGTG TTAGCCCTTA TTTACAATTC
GTTATAAAAG GTTTAATTAT AGTTCTAGCT GTAGCTCTTG ATATGAGAAA ATACATGAAA
AAGAACTAA
 
Protein sequence
MQTKSKKKFN LNSQWLMNNA IYIVLVVLLI GICIISLDFL SLKNFINILS QSSSRIIIAL 
GVGGILLTEG TDLSAGRTVG LAAVVSASLL QAGDYAYKMY PNLPELPIFI PILIAMAVCG
IVGLVNGLVV SKFNVPPFIA TLGMMTGIYG LNSIYFDRPP YGAMPIGGLS RSFSNFTLGS
IPIYGNIKIP YLVIYAIIVI AVIWTLWNKT KFGKNLYAIG GNREAAVVSG VNVVRTLLLV
YMLAGVLYGF AGALEAGRVG SATASTGEMY ELDAIASCVV GGVSTAGGVG TVPGIVTGVL
IFQVINYGLA FIGVSPYLQF VIKGLIIVLA VALDMRKYMK KN