Gene CPF_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1623 
Symbol 
ID4203639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1840499 
End bp1841575 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content25% 
IMG OID638082501 
Productaminotransferase family protein 
Protein accessionYP_696066 
Protein GI110799374 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0121339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTA ATCATGGTGC TAATTTATTT GATTTATCAA ATGAATTAGG TATAAATAAA 
AAAGATATTA AAGATTTTAG TTCAAATATA AATCCCTTTG GAGCATCTAA AAAAGCTAAA
GATGCAATTT TAAATAATAT TGATATGGTT TCTATATACC CAGATCCTAA ATATAAAGAT
TTAAAGGAGT CAATCTCTCA ATATTGTCAT TGCAAAAAAG AAAATATTAT AGTTGGTAGC
GGAGCTACAG AATTAATATC CTCTTTTATA AGTGTTATAA ACCCTAAAAA AGCTTTATTA
CTTTCTCCAT CTTATTCAGA ATATGAAAGT GAACTTGAAA AGATAAATTG TGAAATAACT
AAGTTTTTTT CCAAAGAAGA AGATAACTTT AAAATAGATG TTAACAAATT AATAGATAGC
ATAAACTCTT CAAAGTTTGA TTTAGTTATA ATTTGTAATC CTAATAATCC TACTGGATTC
GCCTTTTCAA AGGATGAAAT TTCTTTATTA CTAAAAAATA CTTCATCAAT ATTCATGGTT
GATGAAACTT ATGTTGAGTT TACAGAGCCT GAAATTTACT CCTCTACTCC ACTAGTAGAT
ATATTTAATA ATCTATTTGT AATTAGAGGA ACTTCTAAAT TTTTCTCAAC TCCAGGTATA
AGATTAGGTT ATGGACTTAT TTCTAATAAA GAAATTAAAA AGTCAATGGT TGAAAAACTT
GATTTGTGGA ATATAAATAT CTTTGCTACA ACAATGGGAG AAATTATGTT TAAGGATAAA
GAGTATATTC TTTCAAATAC CTCTAAGTTA AAAGAAGAAA GAGATTATTT ATTTAGAGAA
TTAAGTTCAA TAAAGGACTT AAAGGTATAT GAAAGTTACA GTAACTTTAT ACTTTGTAAA
ATTAGATCTA AAAAATTCAC TGCTACTGAA CTTTATAATA AACTTTTAGA AAAAGGATTG
ATAATAAGAA ACTGTTCTTC TTTTGAAGGT TTAAATGAAT ATTTCTTTAG GGTTTGTGTC
TTAAAACCTG AAGATAATAA ACTTCTTATA GAAAATCTTA AAAATTTATT TTTATAA
 
Protein sequence
MSINHGANLF DLSNELGINK KDIKDFSSNI NPFGASKKAK DAILNNIDMV SIYPDPKYKD 
LKESISQYCH CKKENIIVGS GATELISSFI SVINPKKALL LSPSYSEYES ELEKINCEIT
KFFSKEEDNF KIDVNKLIDS INSSKFDLVI ICNPNNPTGF AFSKDEISLL LKNTSSIFMV
DETYVEFTEP EIYSSTPLVD IFNNLFVIRG TSKFFSTPGI RLGYGLISNK EIKKSMVEKL
DLWNINIFAT TMGEIMFKDK EYILSNTSKL KEERDYLFRE LSSIKDLKVY ESYSNFILCK
IRSKKFTATE LYNKLLEKGL IIRNCSSFEG LNEYFFRVCV LKPEDNKLLI ENLKNLFL