Gene CPF_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1549 
SymbolmglA 
ID4202214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1767534 
End bp1769081 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content32% 
IMG OID638082427 
Productgalactose/methyl galaxtoside transporter ATP-binding protein 
Protein accessionYP_695992 
Protein GI110798692 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACT CTTCAAACCT GTTAGAAATG CGAAATATCT CTAAAGAATT CCCAGGGGTT 
AAAGCCTTAG ATAATGTAAC CTTAAAAGTA AAAAAAGGTT CTGTACATGC ATTAATGGGA
GAAAATGGTG CTGGTAAATC AACCTTAATG AAATGTCTAT TTGGTATATA TCACCCTAAT
TCAGGAGAAA TTTTTATTTC TGGCCAAAAG GTACAATTTA AAAATTCAAA ACACGCCCTA
GATAATGGAG TATCTATGGT TCACCAAGAA CTAAATCAAG TTAGAGAAAG AAATGTTATG
GATAATCTTT GGCTTGGTAG ATATCCTAAA AAAGGACTTT TTATAGATGA AAAGAAAATG
TATGATGAAA CAGAAAAAAT CTTTAAAGAT CTAGATATAA ATGTTAATCC TCGTGATAAG
GTTTCTACCC TATCTGTTTC TCAAATGCAA ATGGTTGAAA TAGCAAAGGC TGTTTCGTAT
AACTCTAAAA TAATAGTAAT GGACGAGCCT ACTTCCTCTT TAACAGAAAA AGAAGTAAGT
CATCTATTTA AAATAATAAA TAAACTTAGA AAGCAAGGAA TAAGTATAAT TTATATCTCT
CATAAGATGG AAGAAATCTT AGAGATCTCT GATGAAGTTA CCATAATGAG AGATGGTAAA
TGGATTGCTA CTGAAAAAGC TTCAGATCTT ACTATGGATT TAATAATAAA ACTTATGGTT
GGACGTGAAC TTACTGATAG ATTCCCTAAA AAGGATCATA TTCCTAAAGA AACTATTTTA
GAAGTAAATA ATCTTAGTGA TGCCAAAAAT GAATTAAAGA ATGTTTCCTT TAAACTTAGA
AAGGGAGAAA TTTTAGGAAT TGCAGGTCTT GTTGGTGCTA AAAGAACTGA GACCTTAGAA
ACCTTATTTG GCCTTAGAGA AAAGGGCTCT GGAGATATTA TTTTACATGG CAAAAAAGTT
GATAACAGTA AGCCTTTTAA GGCTATGCAA AATGGTTTTG CCCTTGTTAC TGAAGAAAGA
AGACAAACTG GAATCTTTGG AAAATTACCT ATAGATTTTA ATTCTATAAT AGCTAATATA
GATAGTTATA AAACATCAAC CGGTCTTTTA GCAAATGGAA GAATCTCTAA AGATACTCAA
TGGGTTATAG ATTCAATGAA AGTAAAAACT CCAAGTCAAA AAACTCTAAT CGGTAGCCTA
TCTGGTGGTA ATCAACAAAA GATAGTAATT GGAAAATGGC TTCTTAGAAA ACCTGAAATA
CTACTTCTAG ATGAGCCTAC TAGAGGTATA GATGTTGGTG CTAAATTCGA AATATACCAA
CTTATAAATG AACTTGCTAA AGAAGACAAA GGAATAATTA TGGTTTCTTC TGAAATGCCT
GAACTTTTAG GTGTATGTGA CAGAATACTA GTCATGAGTA ATGGTAGGGT TTCTGGCATA
GTTAATGCTA ATGAGACTAC CCAAGAGGAA ATTATGCATC TATCTGCAAA ATATCTATCA
GTAACAGGAG GAGTTAACAA TGCAAACCAA ATCAAAGAAA AAGTTTAA
 
Protein sequence
MKDSSNLLEM RNISKEFPGV KALDNVTLKV KKGSVHALMG ENGAGKSTLM KCLFGIYHPN 
SGEIFISGQK VQFKNSKHAL DNGVSMVHQE LNQVRERNVM DNLWLGRYPK KGLFIDEKKM
YDETEKIFKD LDINVNPRDK VSTLSVSQMQ MVEIAKAVSY NSKIIVMDEP TSSLTEKEVS
HLFKIINKLR KQGISIIYIS HKMEEILEIS DEVTIMRDGK WIATEKASDL TMDLIIKLMV
GRELTDRFPK KDHIPKETIL EVNNLSDAKN ELKNVSFKLR KGEILGIAGL VGAKRTETLE
TLFGLREKGS GDIILHGKKV DNSKPFKAMQ NGFALVTEER RQTGIFGKLP IDFNSIIANI
DSYKTSTGLL ANGRISKDTQ WVIDSMKVKT PSQKTLIGSL SGGNQQKIVI GKWLLRKPEI
LLLDEPTRGI DVGAKFEIYQ LINELAKEDK GIIMVSSEMP ELLGVCDRIL VMSNGRVSGI
VNANETTQEE IMHLSAKYLS VTGGVNNANQ IKEKV