Gene Pnap_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4666 
Symbol 
ID4685862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008760 
Strand
Start bp53531 
End bp54583 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID639826661 
Productvon Willebrand factor, type A 
Protein accessionYP_973824 
Protein GI121583393 
COG category[R] General function prediction only 
COG ID[COG4245] Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.404551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.0464435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGTT TGCCTGTGTT TTTCGTCCTC GACTGTTCCG AGTCCATGGT GGGTGCAAAC 
CTAAAAAAAA TGGAAGGTGC CGTCGCTGCG ATTGTCAAAT CGCTGCGCAC CGATCCGCAG
GCGCTGGAAA CTGTCTTTTT CTCAGTGATC GCATTTGCGG GTGTGGCCAG AACCATTGCG
CCGCTGGTTG AAATCGTGTC TTTCTACCCT CCGAAACTTC CTCTCGGCGG TGGCACGAAT
CTGGGATCGG CCTTGGACGC TTTGATGGGT GAAATCGACA GATCAGTGAT CAAAACGACG
GCCGAGCGCA AGGGCGACTG GCGACCCATC ATCTACTTGG TCACCGATGG CCGTCCTACC
GATAACCCGA GTCGAGCAAT TGAACGGTGG AATTCTCACT ATGCCAAAAA GGCAACGCTC
ATCGCCATAG GTCTGGGGCG TTCAGTCGAC TTTACGGCGC TGCGGCGCCT CACCGAGAAT
GTCATTTCCT TTGAAGATAT AAAGGAGAGC GACTTTAAGA AGTTCATTAA CTGGGTGACA
GCTTCCGTAG TAGTGCAAAG CAAAAGCGTC GGAGATGGAA CAGATTTTCA GGGGCTGCGC
ATTCTTGACA AGAGCGTGAT GAAAATCATC ATGGAGCCTC CTTCAACGAT TGCCGATGAA
ACTGTGGTGA CGCTGATCGG CCGGTGCCAA AAAACCAGCC GGCCCTACAT CATCAAATAC
GAGCAAGCTA TGCAAGATGT CGTCATGAAG GACTTCAAGG TCCAGGTTTC CAGGTACGAG
ATTGCCGGCT GCTATCCGCT GGAAGAGGAT TATTTTGAAT GGTCTGATCC GCGCACGGTT
GACCTCAAGG TCAACACCTC GGAGCTATTT GGAGCGCCGG GTTGCCCTCA CTGCGGCGCG
CACACGGCTT TTGCCGTCTG TGGGTGCGGA AAGCTTTTAT GCCTGAACGA TGCAGCGTCA
GTGGTTTGTC CATGGTGTCA AAAAACCGTC TCGTTCTCTT CGCTGGGGCC AGACGATGAA
GGTGGGTTCG AGGTCAGGCG AGGCAGAGGC TGA
 
Protein sequence
MRRLPVFFVL DCSESMVGAN LKKMEGAVAA IVKSLRTDPQ ALETVFFSVI AFAGVARTIA 
PLVEIVSFYP PKLPLGGGTN LGSALDALMG EIDRSVIKTT AERKGDWRPI IYLVTDGRPT
DNPSRAIERW NSHYAKKATL IAIGLGRSVD FTALRRLTEN VISFEDIKES DFKKFINWVT
ASVVVQSKSV GDGTDFQGLR ILDKSVMKII MEPPSTIADE TVVTLIGRCQ KTSRPYIIKY
EQAMQDVVMK DFKVQVSRYE IAGCYPLEED YFEWSDPRTV DLKVNTSELF GAPGCPHCGA
HTAFAVCGCG KLLCLNDAAS VVCPWCQKTV SFSSLGPDDE GGFEVRRGRG