Gene CPF_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0891 
Symbol 
ID4203023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1056901 
End bp1058268 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content34% 
IMG OID638081773 
Productethanolamine ammonia-lyase, large subunit 
Protein accessionYP_695340 
Protein GI110801427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAA AAACAAAATT ATTTGGAAAA GTCTATGCTT TCAAATCTTT AAATGAGGTT 
ATGGCTAAGG CAAACGAAGA GAAATCAGGA GATAGATTAG CTGGATTAGC AGCAGAGTCT
TCAGAAGAAA GAGTAGCAGC AAAGGTTGTA TTATCAAATA TAACTTTAGA GGATTTAAGA
AATAACCCAG CAGTTCCTTA TGAAATAGAT GAGGTAACTA GAATAATTCA AGATGATGTA
AATGAAAAAA TATACAATGA AATAAAACAT TGGACAGTAT CTGAATTTAG AGAGTGGATA
TTAGATGAAA ATACAACAGG TGCTGATATT AGAAGAATTT CAAGAGGTTT AACTTCTGAA
ATGGTAGCAG CTGTAGCAAA ATTAATGTCT AATATGGACT TAATATATGG AGCAAGAAAG
ATAAAGGTAA CAGCTCACTG TAACACAACA ATAGGTGAAA AGGGAACTTT ATCTGCAAGA
CTTCAACCAA ACCATCCAAC AGATGATCCA GATGGAATAA TGGCTTCATT ATTAGAAGGA
TTAACTTTTG GTGTTGGAGA TGCAGTTTTA GGATTAAACC CAGTTGATGA CTCTGTTGAG
AGTGTTACTA AAGTATTAAA GAGATTTGAT GAAATAAAAA GAAAATTTAA AATACCAACT
CAAACTTGTG TACTAGCTCA CGTAACAACT CAAATGGAAG CTATAAGACA AGGGGCGCCT
ACAGACCTAA TATTCCAATC AATAGCAGGT TCTGAAAAGG GAAATGAAGC TTTTGGATTT
AATGCAGCAA CTATAGAAGA AGCTAGACAA TTAGCTTTAA AACAAGGAAC GGCTACAGGA
CCAAATGTAA TGTACTTTGA AACAGGACAA GGTTCAGAGC TTTCATCAGA TGCTCACCAT
GGAGTTGACC AAGTAACTAT GGAAGCTAGA TGTTATGGAT TCGCTAAGAG ATTCCAACCA
TTCTTAGTTA ACACAGTTGT TGGATTCATA GGACCAGAGT ATTTATATGA TTCAAAACAA
GTTATAAGAG CAGGTCTTGA AGACCACTTC ATGGGTAAAT TAACAGGAAT ACCAATGGGA
TGTGATGCAT GTTATACAAA CCACATGAAA GCAGATCAAA ATGATATAGA AAACTTAGCT
GTATTATTAA CAACAGCAGG ATGTACTTAT TTCATGGGAA TTCCACATGG AGATGACGTA
ATGCTTAACT ATCAAACTAC AGGATACCAT GAAACAGCAG CTTTAAGAGA AATGTTTGGA
TTAACAGCTA TTAAAGAGTT CCAAGATTGG TTAGTTGAAA TGGGATTCGT AGACGAAAAT
GGAAAGCTTA CTAAAAAAGC AGGAGATGCA TCTGTACTTT TAGGATAG
 
Protein sequence
MILKTKLFGK VYAFKSLNEV MAKANEEKSG DRLAGLAAES SEERVAAKVV LSNITLEDLR 
NNPAVPYEID EVTRIIQDDV NEKIYNEIKH WTVSEFREWI LDENTTGADI RRISRGLTSE
MVAAVAKLMS NMDLIYGARK IKVTAHCNTT IGEKGTLSAR LQPNHPTDDP DGIMASLLEG
LTFGVGDAVL GLNPVDDSVE SVTKVLKRFD EIKRKFKIPT QTCVLAHVTT QMEAIRQGAP
TDLIFQSIAG SEKGNEAFGF NAATIEEARQ LALKQGTATG PNVMYFETGQ GSELSSDAHH
GVDQVTMEAR CYGFAKRFQP FLVNTVVGFI GPEYLYDSKQ VIRAGLEDHF MGKLTGIPMG
CDACYTNHMK ADQNDIENLA VLLTTAGCTY FMGIPHGDDV MLNYQTTGYH ETAALREMFG
LTAIKEFQDW LVEMGFVDEN GKLTKKAGDA SVLLG