Gene CPF_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2363 
Symbol 
ID4202567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2627910 
End bp2628941 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content29% 
IMG OID638083228 
Productputative thiamin biosynthesis lipoprotein ApbE 
Protein accessionYP_696786 
Protein GI110799064 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAA AGAGAGCAAT TGTTATTTTA TTAGTTTCTA TACTTTCTAT GGGATTAATA 
TCTTGTGATA ATTCAAAAAA AGTGGAAAGC AATAATAAAG AAGTTAATTC CTATGAAAAA
ACAGAAAAGA TTTTAGGTAC TGTAGTTAGT GGAGTAGCAT ATGGAGATAA TGCTAAGGAA
GCCTTAGAAA AAGCTTTTAA TAGAGCAAAG GACATTGAGA ATATGATGTC AGTAAATATA
GAGAATAGTG AGCTTAGCAA GGTAAATTCT GAAGCTTTTC ATAAAAGTGT TAAGTTATCA
GATGATTTAT ATTATGTCAT AGAAAAATCT ATATATTATG CTAATTTAAC TGATGGAGCT
TTAGATCCAA CTATAGGACA TGTAATTGAT TCTTGGGGAA TAGGAACAGA ACATGCTAAT
ATACCTGAAA AAACTTTAAT TGATAAGTAT AAAGATTTAA AAAATTATAA GAATATAGAA
TTAAATCCTC ATACTAAGGA AATAAGATTT TTAAATGAAA ATATAAAATT AGATTTAGGA
GCCATTGGAA AAGGATATGC TGGAGATGAA ATGAGAAAAG TTTTAAGGGA AGAAGGAATA
AATTCAGCAT TACTTAATTT AGGGGGAAAT GTTGTTGCCT TAGGAAATAA GATAAATAAT
GAAAATTGGA GCATAGGAAT AAGAAATCCT AAGGAAGAGA ATGAGATTTC AGCTTCTGTA
AAAATTAATG ATGAAGTTGT GGTAACATCA GGAAATTATG AAAGATATTT TATAAAAGAT
GGAGTTAGAT ATCATCATAT ATTAGATCCT AGCACTGCTT ATCCAGCTGA AAAGGGACTT
ATAAGTTCAA CAATAATCAC TAAAAATGGA ATAGATGCAG ATGCCTTATC AACAGCTACT
TATATTTTAG GGGCAGAAAA AGCTAAGAAA CTTATTGAAG GATTAGATGG GGTAGAGGCA
TTATTTATAA AAGATAATAT GGACTTCATA GAAACAAGCA ACTTGGATAA CAAAGGATTT
AGGGGGATGT AA
 
Protein sequence
MLKKRAIVIL LVSILSMGLI SCDNSKKVES NNKEVNSYEK TEKILGTVVS GVAYGDNAKE 
ALEKAFNRAK DIENMMSVNI ENSELSKVNS EAFHKSVKLS DDLYYVIEKS IYYANLTDGA
LDPTIGHVID SWGIGTEHAN IPEKTLIDKY KDLKNYKNIE LNPHTKEIRF LNENIKLDLG
AIGKGYAGDE MRKVLREEGI NSALLNLGGN VVALGNKINN ENWSIGIRNP KEENEISASV
KINDEVVVTS GNYERYFIKD GVRYHHILDP STAYPAEKGL ISSTIITKNG IDADALSTAT
YILGAEKAKK LIEGLDGVEA LFIKDNMDFI ETSNLDNKGF RGM