Gene CPF_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1520 
Symbol 
ID4201182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1736160 
End bp1737722 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content32% 
IMG OID638082398 
ProductvanW-like family protein 
Protein accessionYP_695963 
Protein GI110800526 
COG category[V] Defense mechanisms 
COG ID[COG2720] Uncharacterized vancomycin resistance protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.516908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGAAG AAAAGAAGAA ATCAAGCACA GGCTTACTTA AAAGCAAAAA GAAAATAATA 
ATATCAATAG TTATTGTATT AGCAATTATA ATAGGTTCTA TTGTTGCATA TATAGTTAGT
ATTCAGAAAA AAGTTGAAGA GTGGAATGAT AAGATATATC CTAACGTATA TGTTGAAAAT
GTAAATTTAT CAGGAATGAC AAAGGAAAAG GCCATTGAGG TTTTAGAGAA GGATGTAAAA
GAACCTGTAG AACATAAAAC TATAAAAGTT CAGGCGGCAG ATAAAAGTAT TGAAATAAAA
TATTCTGATT TATCACCAGA ATATAATATA GATGAAACTG TTAATGAAGC TATGAATTAT
GGAAAAGATT TAAATCTTTT TGAGAAAAAT AACCTTATAA ATGGAAAAGA TAAAAAGGAA
TTAAATTTAG ATTTTAAATA TGATGAATCT AAGTTAACAG ATTATGAGAA AAAACTTACT
GAAATGGTAA ATCAAAATGC TAAAAATGCT ACCATAAGTA TAAATGGTAG TAATATAAGT
GTAATAGAAG GCGAAGATGG AAGAGCCATA GAAGAAGATA AAATGGTTTC TTTAGTAAAA
GAAGCTATAA ATGCAAATCC AGAGGATAAT TCAGTTGTGG AAGTACCTGT AGAGGTTACA
AAACCAAAAA TAACTAAGGA AATGCTTTCA AAAATAGACG GCGTTATAGG AAGTTTTACA
ACAAGTTATA CAAGCTCAGA TGCTAATAGA AGTGCTAATG TTGAAATTGC AGCTAAAACG
GTTAATGGAA CTATTTTAAT GCCAGGAGAT ACATTTAGTT ATAATAATAC TTTAGGGGAA
AGAACCACAG CTAAGGGATA TAGAGATGGA GCGGCTTACG TAGGAAATAA AGTAGTAATG
GTTACTGGTG GAGGAATCTG TCAAGTTTCT ACAACATTAT ACAGAGCTGT TTTAAGAGCT
GGAATAATGC CAACAGAGAG ACATAATCAT AGTATGACAA CTACTTATTC AGGCCCAAGT
GAAGATGCTA CAGTTTCATG GGGATCTTTA GACTATCAAT TTAAAAACCC TTATGATTTC
CCAATATATA TACAAGGATA TACAAGTAAT AAACATGTAA CATTTAATAT ATATGGAAAT
GTACAAGGTA TGGATGGAAA AACTTATGAA TTACAAACTG TAGTAAATGA AACTCTAAAA
CCATCAGTTA AAACAGTTGA TGATCCTAAT TTGCCAGAGG GACAAAAAGT TGTTGAGCAA
AGACCAGTTA CAGGATATAA GTCATCAGGA TATTTAGTAA CTTATCAAAA TGGAAAAGAA
ATAGATAAGA AATTAATAGG ACATGATGTA TATAAACAAA AGGATGAAAT TATAAAGGTT
GGAACAAAAA AAGCTGAGCA ACCAAAGCAA GAAGCACCAA AACAAGAACA GCCAGCTACT
GCAAAGCCAG AGGAGCCAAA ACAAGAGGCA ACTCAGCCAT CAACATCACA ACCTGCTACA
AATCAAGCAC CTGATGCAAC ACCTCAAACA CCTAATGCAG GGCAAACACC ACCAGCGCAG
TAA
 
Protein sequence
MKEEKKKSST GLLKSKKKII ISIVIVLAII IGSIVAYIVS IQKKVEEWND KIYPNVYVEN 
VNLSGMTKEK AIEVLEKDVK EPVEHKTIKV QAADKSIEIK YSDLSPEYNI DETVNEAMNY
GKDLNLFEKN NLINGKDKKE LNLDFKYDES KLTDYEKKLT EMVNQNAKNA TISINGSNIS
VIEGEDGRAI EEDKMVSLVK EAINANPEDN SVVEVPVEVT KPKITKEMLS KIDGVIGSFT
TSYTSSDANR SANVEIAAKT VNGTILMPGD TFSYNNTLGE RTTAKGYRDG AAYVGNKVVM
VTGGGICQVS TTLYRAVLRA GIMPTERHNH SMTTTYSGPS EDATVSWGSL DYQFKNPYDF
PIYIQGYTSN KHVTFNIYGN VQGMDGKTYE LQTVVNETLK PSVKTVDDPN LPEGQKVVEQ
RPVTGYKSSG YLVTYQNGKE IDKKLIGHDV YKQKDEIIKV GTKKAEQPKQ EAPKQEQPAT
AKPEEPKQEA TQPSTSQPAT NQAPDATPQT PNAGQTPPAQ