Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1070 |
Symbol | |
ID | 4200945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1215484 |
End bp | 1216935 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638081951 |
Product | PTS system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_695516 |
Protein GI | 110798745 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.121451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAT ATTTTCAGAA ATTAGGTAAA TCTTTAATGC TACCAGTTGC TGCACTTCCA GTAGCAGGCA TATTAATGGG AATTGGATAT TGGCTAGACC CATCTGGATG GGGTGCTAAT AGTGTAGTAT CAGCATTCTT ATTAAAATCT GGTGGAGCTA TTATAGATAA CATGGCAATC TTATTTGCTA TTGGGGTAGC TGTTGGTATG TCAGATGATA ATGATGGAGC AGCAGGTTTA GCAGGTTTAG TATCATGGCT TATGATAACT ACATTATTAT CTAGTGCAGT TGTTGCAATG ATGCAAGGAG TAGATGTTGA AGCTGTTAAC CCAGCCTTTG GAAAGATACA AAACCAATTT ATAGGTATCG TAGCCGGTTT AATAGGAGCT GGTTGTTATA ATAGGTTTAA AAATACTAAA TTACCAGATT TCTTAGGATT CTTTAGTGGA AGAAGATGTG TTGCTATAGT AACAGCACTT GCTTCAATAG TAGCTTCTTT AGTTTTATAC TTTGTATATC CTGTAATATA TAGCGCATTA GTTTCATTTG GTGAATCAAT AATAGCTATG GGACCAATAG GAGCTGGTGT TTATGGATTC TTCAATAGAT TATTAATACC ACTTGGTTTA CACCATGCTT TAAACTCAGT ATTCTGGTTT GACGTTGCTG GTATAAATGA TATAGGAAAC TTCTGGGCTG GAACAGGTAC TTTAGGACAA ACTGGTATGT ATATGTCAGG TTTCTTCCCA GTAATGATGT TTGGTTTACC TGCTGCTGCT TTTGCAATGT ATCGTTGTGC AAAAACTAAT AAAAAGAAGG CTGCTGCTGG TATATTATTA GCTGCTGCTT TATCATCATT CGTTACAGGA GTAACTGAAC CATTAGAATT TGCATTCATG TTCTTAGCAC CTGCTTTATA TGTAGCTCAT GCTTTATTAA CAGGAATATC TATGGCAGTT GTTGCTGCTT TACCAGTAAG AGCAGGATTT AACTTTAGTG CTGGTTTAGT AGACTGGATA TTAAGTTTTG CATCACCAAT GGCATTAAAC CCATTATACT TATTAGGTAT AGGTTTAGTA GTGGGAGTTA TATACTACTT AGTATTCGTA GTTATGATAA AGAAATTTGA CTTAAAGACT CCAGGTAGAG AAGATGATGA GGATGATGAT TCAAATGCAG TTTTAGCAAA TAATAACTAC ACTGAAGTTG CAAAAATCAT CTTAGAAGGA CTAGGTGGAC CTTCTAATAT TACTTCAATT GATAACTGTA TAACTAGATT AAGATTAGAA GTAAAAGACA GCACACTAGT TAATGAGAAA AAGATAAAAT CATCTGGTGT TTCAGGAGTT ATAAGACCTG GAAAAACAAG TGTTCAAGTT ATAGTTGGAA CACAAGTACA ATTCGTAGCT GACGAGTTTA AGAAACTTTG TAAGTCTAAA AATTACGTAT AA
|
Protein sequence | MMKYFQKLGK SLMLPVAALP VAGILMGIGY WLDPSGWGAN SVVSAFLLKS GGAIIDNMAI LFAIGVAVGM SDDNDGAAGL AGLVSWLMIT TLLSSAVVAM MQGVDVEAVN PAFGKIQNQF IGIVAGLIGA GCYNRFKNTK LPDFLGFFSG RRCVAIVTAL ASIVASLVLY FVYPVIYSAL VSFGESIIAM GPIGAGVYGF FNRLLIPLGL HHALNSVFWF DVAGINDIGN FWAGTGTLGQ TGMYMSGFFP VMMFGLPAAA FAMYRCAKTN KKKAAAGILL AAALSSFVTG VTEPLEFAFM FLAPALYVAH ALLTGISMAV VAALPVRAGF NFSAGLVDWI LSFASPMALN PLYLLGIGLV VGVIYYLVFV VMIKKFDLKT PGREDDEDDD SNAVLANNNY TEVAKIILEG LGGPSNITSI DNCITRLRLE VKDSTLVNEK KIKSSGVSGV IRPGKTSVQV IVGTQVQFVA DEFKKLCKSK NYV
|
| |