Gene CPF_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1119 
SymbolgngC 
ID4203716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1278343 
End bp1279605 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content32% 
IMG OID638082000 
Productendo-beta-galactosidase, GlcNAc-alpha-1,4-Gal-releasing 
Protein accessionYP_695565 
Protein GI110800048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.308798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTCT TTATGTTACT ATTGTTGCTA CCATTTACTA TTTCAAAAGC AAAGGATTTT 
CCAGCAAATC CAATTGAAAA AGCTGGATAT AAACTAGATT TTTCTGATGA GTTCAATGGT
CCTACATTAG ATAGAGAAAA ATGGACTGAT TATTATTTAC CACATTGGTG CAAGGATCCT
GAAAGTGCTA AGGCTAATTA TCGCTTTGAA AATGGATCAC TTGTTGAATA TATAACTGAA
GATCAGAAAC CATGGTGCCC AGAGCATGAT GGAACTGTTA GATCATCTGC CATAATGTCT
TTTGATAAAA GTTGGATACA TAATTTTAGT GGAACAACTG ATAATCATGA AAGAAATGAG
TGGAGAGGTT ATACAACTAA ATATGGATAC TTTGAAATTC GTGCTAAGTT ATCTAACACA
GGTGGTGGAG GCCATCAAGC TTGGTGGATG GTTGGTATGC AGGATGATAC TAATGATTGG
TTCAATTCAA AACAAACAGG TGAAATTGAT ATATTAGAAA CTTTCTTTAG TAAAAAAGAT
ACATGGAGAA TCGCTGCATA TGGATGGAAT GATCCAAACT TCCAAACATC TTGGACTATT
TCAGAAGATA AAGTTCCATC AGGAGATCCA ACTTCTGAAT ATCATATTTA TGCAATGGAA
TGGACTCCTA CTGCTTTGAA ATTTTATTAT GATAATGAAT TATTTAAGGT TATATATGGT
TCACCAGACT ATGAAATGGG GACAATTTTA AATATATACA CAGATGCAGG TTCAGGTGTT
CATAATGATG TTTGGCCTAA GGAATGGGCA ATTGATTATA TGAGAGTTTG GAAACCAGTA
GATGGATATA AAGAGAGTGA AAGTTTAAAT AATTACTTAA TAAGAAATAG ACAAACAGGA
AAATTCCTTT ATATTGAAGA AAATAATGAT AAAGTGTCTT ATGGGGACAT AACTTTAAAA
AATGAAAAAA ATGCAAAATG GAGTAAAGAA TATAGAGATG GATACACTTT ATTAAAGAAT
AATGAAACAG GAGAATATTT AAATATAGAA AACCAAACTG GATATATAGA ACATGGTAAG
GTTCCAAAAA CTTGGTGGAG TGCTCAATGG AGTGAAGTAC CAGTAGATGG ATATACAAGG
TTTGTTAACA GATGGAAGCC TAATATGTCA ATACATACAG AAAGTTATGA AGGCGTTTTA
CAGTATGGAA ATGTTCCAAA TACTTATTGG ACAAGTCAAT GGCAACTTAT TCCTGTAGAA
TAA
 
Protein sequence
MFVFMLLLLL PFTISKAKDF PANPIEKAGY KLDFSDEFNG PTLDREKWTD YYLPHWCKDP 
ESAKANYRFE NGSLVEYITE DQKPWCPEHD GTVRSSAIMS FDKSWIHNFS GTTDNHERNE
WRGYTTKYGY FEIRAKLSNT GGGGHQAWWM VGMQDDTNDW FNSKQTGEID ILETFFSKKD
TWRIAAYGWN DPNFQTSWTI SEDKVPSGDP TSEYHIYAME WTPTALKFYY DNELFKVIYG
SPDYEMGTIL NIYTDAGSGV HNDVWPKEWA IDYMRVWKPV DGYKESESLN NYLIRNRQTG
KFLYIEENND KVSYGDITLK NEKNAKWSKE YRDGYTLLKN NETGEYLNIE NQTGYIEHGK
VPKTWWSAQW SEVPVDGYTR FVNRWKPNMS IHTESYEGVL QYGNVPNTYW TSQWQLIPVE