Gene CPF_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1576 
Symbol 
ID4201424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1795557 
End bp1798811 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content30% 
IMG OID638082454 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_696019 
Protein GI110800747 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.929856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCTA ATGTAAAAAT AGGAGCAAAT ACATCTGACT TTCAAAAGCA AATGAAGGAT 
ATGGTAAGAG AGCTTAAATC TCTTGGCAGT ACATTTAATT TAGCAAGTAC TCAGGCTAAA
TTATTTGGTA ATGCTAATGA ACAATTAAAA GTAAAACAGG CTGAGCTTAC TGAAAAAATG
AAACTTCAAA ATAGAATGAT TAATTTACAA AAAGAAGCAA TTAATAAGTT GACTAATGAT
GTTCAGAAGC AAAAAGAAAA AAGAGAAGAA TTATCTAAAA AAATAGAAGA AGTAACTAAA
AAATATAAAG AAAGTTCTGA AGCTACTGGA AAGAATAGCG AGGAAACAAA AAAGTTAGGA
AAATCTTTAG CTGATTTAAA AGAAGAATAT GCTAGGAATG AAAGAGCAAT AGATTCATCT
AATAGAAAAA TAGATACTGC AAATATGAAA ATGAATAAAT CTAAGACAGA GCTATTAGAA
AATAAAAAAG CTTTAGAAGA GGTTGATAAG AAATTAAAAG ATATCAATTT AGATAAATTC
TCTCAAAAAA TGGACAAGGT AAGCAATGCT ACAGGGAAGG CAGCTAGTGC TTTAAAACCA
GCAGCAATAG CTGTTACTGG ATTTGGTGTT GCTGCTGCAG TGACAGAAAT GAAGTTTCAG
GATGGTATAG CAAATATAAA TACTCTTTTA GATGACCAAA GTCATTTAGA AGGATATAAA
AATAAAATAA TGGAAGTATC AAACCAAACA GGAATTGATT TGAAGATTGT AACAGATGGT
ATGTATCAGG CTATTTCATC AATTGGAGAT GGAGGAGCAG AAACAGAGAA GATATTTGAT
ACTATGGCTA AAAGTGCTAA GGCTGGTGGA GCAGAGGTTA AAGATGCAGT TGCTTTAATT
AGTGCAGGAA TGAAAGGATA TAATCAAGTT AATGATGAAA CAGCAAAGAA GATAAGTGAT
TTAGCCTTCC AAACAGCTAA GTTGGGAGTA ACAACTTTTC CAGAAATGGC TTCAAGCATG
CAACCATTAT TCCCACTAGC AAGCAATTTA AATTTATCAA TGACAGATTT ATTTACTAAT
ATGGCTACAT TAACAGGGGT TACAGGAAAC ACTTCAGAAG TATGTACCCA GTTAAAAGCA
GTATTTAGTA ACTTAATTAA ACCAACAGCA GATATGCAAA AATTAATGGA GAAATATGGC
TTCCAAAATG GTCAAGCAAT GTTAAAAAGT GAGGGCTTAA TTGGTACTTT AAAAATATTA
CAAAAAGAAA CTGGTGGACA ATCAGATAAA ATGGGAAAAC TTTTTAGTAG TACTGAAGGA
CTAACAGCAA TTACAGCATT AACAAGTTCA CAGTTTGATA CGTTGGCAGA TAAGTCTAAG
AAAATGAATG AGGCTATAGG AACAACTGAT TCAGCTCTTG CAAAAATAAA TAATACTACT
GGAAATGATT TAAGAACATC TTTAAACTTA GCTAAAAATA GTCTAGTTGG ATTTGGTGAA
GTATTAGCAC CATTTATTTC TTTAGCTGCT AAGGGAATAG GTGGTATAGC TAAAGCTTTT
GGTGGATTAA GTGAAGGGCA AAAAAAGGCA GTAGTGGGAA TAGGAGCTTT TGTAGTTGGT
TCTTATGGAG CTTTAGCAGT AACAACTAAA GTTACAGGAA CAATAAGAAA TGCCATTAAA
GATTATAAAG CATTTAGAGA TGTTATGGAA AAATTAAAAA TTGCTACAAA GTTACAAACA
GCAGCACAGA AAGCTTTAAA TTTTGTTACT GAAATATCTC CAGTAGGAAA AATTATATTA
GTCGTTGGAT TAGTAGTAGG AGCTTTAACT TATTTATATA ATCATTGCGA GTGGTTTAGA
AATGGAGTTA ATAAAATATT TAGTGGACTT ATAAAGTTTT TTACAGAAAC AGTACCTAAT
AAATTAAAGC AGTTAATAAA TTTCTTTAAA AATGATTGGA AGGAAATTTT GTTGCTTATA
GTTAATCCTT TTGCTGGAGC TTTTGCATTA GCTTACAAGC ATTGTGAAGG ATTTAGGAAT
GGAGTGGACA AATTATTTTC AAATATTAAA GAGTTTTTTA CTAAAAGTAT TCCTGATTTC
TTTAAGTGGT TAATATCTAA ACTTACACAA TTTAAAACAG ACTTCATAAA TAAAATAAAA
GAGTTTGGTA CTATAATAAA AACTAGAATA AAAAATTATA TAGAAGAAGT AAAATTTATA
TTTAGTAATT TACCTAAACT TATGGGGATA TTGATAGGAA AAATTGCAGG AGAAATTTAT
AAGGGATTTT TAAACATAAA AATATTTATC ACTAAAACTA TTCCAGATGC AATAAATTCT
ATAAAGCAAT GGTTTGCACA GCTTCCTGAA GCAATAGGGA AACAATTACT TGATTCATAT
ATGAAAGTAA AGACATGGGG AAATAATTTA TATATTTCAG CAAAAAAGAC AGGTAAGGAT
TTTATATATG GAGTGATAGA TTATGTTAAG GAACTACCAC ATAAAATTTG GAATAAGATT
ACTGAAGCTT ATGATGGAGT TACTACATGG GGAAATAATA TGTACCAAGA AGCTAAGAAA
GTGGGAAAGG AATTTGTTGA TTCTATAGTA GATCATGTTA AAGAACTACC AACAAGATTT
AAAAATTGGT TAAAAGAATC TTGGAATAAG GTAAGTGCAT GGGGAGAAGA TTTAAAGACT
GCTGGTAAAG AGAGTGGTAA AAAGTTAGTT GATTCTATAG TAGATACAGT AAAAGCTATA
CCAGGACAAA TGAAAGAGAT AGGTAAAAAT ATAGTTCATG GGATTTGGGA TGGTATAACA
GGTGCTATTG GATGGATAAA AAGTAAGATA AAACAATTTT GTGATGGAAT TGTAGAAGGA
TTTAAAAGTT CTTTAGATAT ACATTCTCCA TCTAGGGTGT TAAGAGATCA GGTTGGTAAA
TTTATGGCTC AAGGTGTTGG AGTAGGTTTT GTCAATGAAA TGGAAGATAT AAATTTAGAT
ATAAAGCAAA GTTTGGATAG AACTATAAAT ACTAATATAG TACCTTCTAT ATCAAATGTT
GACTTAGAAA AAATAAATAC AAAACTTAAT AATACAAACA ATAATAATAT AGTAGTTGTA
ATTGAAAATG TAACTAATTT AGATGGTGAA GTAATCAGTA CTAAAGTATA TAAGAAAGTA
GCTAAACAAA TGAAAAGTGA TGAAAATAGT TATAGAGTTA CTAAAGGAAA GAAGGGAGGT
CGAATATGTG CATAA
 
Protein sequence
MGANVKIGAN TSDFQKQMKD MVRELKSLGS TFNLASTQAK LFGNANEQLK VKQAELTEKM 
KLQNRMINLQ KEAINKLTND VQKQKEKREE LSKKIEEVTK KYKESSEATG KNSEETKKLG
KSLADLKEEY ARNERAIDSS NRKIDTANMK MNKSKTELLE NKKALEEVDK KLKDINLDKF
SQKMDKVSNA TGKAASALKP AAIAVTGFGV AAAVTEMKFQ DGIANINTLL DDQSHLEGYK
NKIMEVSNQT GIDLKIVTDG MYQAISSIGD GGAETEKIFD TMAKSAKAGG AEVKDAVALI
SAGMKGYNQV NDETAKKISD LAFQTAKLGV TTFPEMASSM QPLFPLASNL NLSMTDLFTN
MATLTGVTGN TSEVCTQLKA VFSNLIKPTA DMQKLMEKYG FQNGQAMLKS EGLIGTLKIL
QKETGGQSDK MGKLFSSTEG LTAITALTSS QFDTLADKSK KMNEAIGTTD SALAKINNTT
GNDLRTSLNL AKNSLVGFGE VLAPFISLAA KGIGGIAKAF GGLSEGQKKA VVGIGAFVVG
SYGALAVTTK VTGTIRNAIK DYKAFRDVME KLKIATKLQT AAQKALNFVT EISPVGKIIL
VVGLVVGALT YLYNHCEWFR NGVNKIFSGL IKFFTETVPN KLKQLINFFK NDWKEILLLI
VNPFAGAFAL AYKHCEGFRN GVDKLFSNIK EFFTKSIPDF FKWLISKLTQ FKTDFINKIK
EFGTIIKTRI KNYIEEVKFI FSNLPKLMGI LIGKIAGEIY KGFLNIKIFI TKTIPDAINS
IKQWFAQLPE AIGKQLLDSY MKVKTWGNNL YISAKKTGKD FIYGVIDYVK ELPHKIWNKI
TEAYDGVTTW GNNMYQEAKK VGKEFVDSIV DHVKELPTRF KNWLKESWNK VSAWGEDLKT
AGKESGKKLV DSIVDTVKAI PGQMKEIGKN IVHGIWDGIT GAIGWIKSKI KQFCDGIVEG
FKSSLDIHSP SRVLRDQVGK FMAQGVGVGF VNEMEDINLD IKQSLDRTIN TNIVPSISNV
DLEKINTKLN NTNNNNIVVV IENVTNLDGE VISTKVYKKV AKQMKSDENS YRVTKGKKGG
RICA