Gene CPF_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1070 
Symbol 
ID4200945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1215484 
End bp1216935 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content35% 
IMG OID638081951 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_695516 
Protein GI110798745 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAT ATTTTCAGAA ATTAGGTAAA TCTTTAATGC TACCAGTTGC TGCACTTCCA 
GTAGCAGGCA TATTAATGGG AATTGGATAT TGGCTAGACC CATCTGGATG GGGTGCTAAT
AGTGTAGTAT CAGCATTCTT ATTAAAATCT GGTGGAGCTA TTATAGATAA CATGGCAATC
TTATTTGCTA TTGGGGTAGC TGTTGGTATG TCAGATGATA ATGATGGAGC AGCAGGTTTA
GCAGGTTTAG TATCATGGCT TATGATAACT ACATTATTAT CTAGTGCAGT TGTTGCAATG
ATGCAAGGAG TAGATGTTGA AGCTGTTAAC CCAGCCTTTG GAAAGATACA AAACCAATTT
ATAGGTATCG TAGCCGGTTT AATAGGAGCT GGTTGTTATA ATAGGTTTAA AAATACTAAA
TTACCAGATT TCTTAGGATT CTTTAGTGGA AGAAGATGTG TTGCTATAGT AACAGCACTT
GCTTCAATAG TAGCTTCTTT AGTTTTATAC TTTGTATATC CTGTAATATA TAGCGCATTA
GTTTCATTTG GTGAATCAAT AATAGCTATG GGACCAATAG GAGCTGGTGT TTATGGATTC
TTCAATAGAT TATTAATACC ACTTGGTTTA CACCATGCTT TAAACTCAGT ATTCTGGTTT
GACGTTGCTG GTATAAATGA TATAGGAAAC TTCTGGGCTG GAACAGGTAC TTTAGGACAA
ACTGGTATGT ATATGTCAGG TTTCTTCCCA GTAATGATGT TTGGTTTACC TGCTGCTGCT
TTTGCAATGT ATCGTTGTGC AAAAACTAAT AAAAAGAAGG CTGCTGCTGG TATATTATTA
GCTGCTGCTT TATCATCATT CGTTACAGGA GTAACTGAAC CATTAGAATT TGCATTCATG
TTCTTAGCAC CTGCTTTATA TGTAGCTCAT GCTTTATTAA CAGGAATATC TATGGCAGTT
GTTGCTGCTT TACCAGTAAG AGCAGGATTT AACTTTAGTG CTGGTTTAGT AGACTGGATA
TTAAGTTTTG CATCACCAAT GGCATTAAAC CCATTATACT TATTAGGTAT AGGTTTAGTA
GTGGGAGTTA TATACTACTT AGTATTCGTA GTTATGATAA AGAAATTTGA CTTAAAGACT
CCAGGTAGAG AAGATGATGA GGATGATGAT TCAAATGCAG TTTTAGCAAA TAATAACTAC
ACTGAAGTTG CAAAAATCAT CTTAGAAGGA CTAGGTGGAC CTTCTAATAT TACTTCAATT
GATAACTGTA TAACTAGATT AAGATTAGAA GTAAAAGACA GCACACTAGT TAATGAGAAA
AAGATAAAAT CATCTGGTGT TTCAGGAGTT ATAAGACCTG GAAAAACAAG TGTTCAAGTT
ATAGTTGGAA CACAAGTACA ATTCGTAGCT GACGAGTTTA AGAAACTTTG TAAGTCTAAA
AATTACGTAT AA
 
Protein sequence
MMKYFQKLGK SLMLPVAALP VAGILMGIGY WLDPSGWGAN SVVSAFLLKS GGAIIDNMAI 
LFAIGVAVGM SDDNDGAAGL AGLVSWLMIT TLLSSAVVAM MQGVDVEAVN PAFGKIQNQF
IGIVAGLIGA GCYNRFKNTK LPDFLGFFSG RRCVAIVTAL ASIVASLVLY FVYPVIYSAL
VSFGESIIAM GPIGAGVYGF FNRLLIPLGL HHALNSVFWF DVAGINDIGN FWAGTGTLGQ
TGMYMSGFFP VMMFGLPAAA FAMYRCAKTN KKKAAAGILL AAALSSFVTG VTEPLEFAFM
FLAPALYVAH ALLTGISMAV VAALPVRAGF NFSAGLVDWI LSFASPMALN PLYLLGIGLV
VGVIYYLVFV VMIKKFDLKT PGREDDEDDD SNAVLANNNY TEVAKIILEG LGGPSNITSI
DNCITRLRLE VKDSTLVNEK KIKSSGVSGV IRPGKTSVQV IVGTQVQFVA DEFKKLCKSK
NYV