Gene Plav_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0239 
Symbol 
ID5455084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp258408 
End bp259388 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID640875802 
ProductTPR repeat-containing protein 
Protein accessionYP_001411519 
Protein GI154250695 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGTT TTGCAGACCA AACCCCGCAA TTCACCTTCC GCGCAAAGCG CGGCGGCTGG 
CTTGCCGCGG CATCGCTCTG CTCCGCCCTC GCGCTTTCAG GATGTGCATC AACCAGCACC
ACCTCGACCG CCCAGACGCC CAGCGAAGCC GCGCAGGCTC AGATCGACAC ACCGGCGCTC
CGGAATGCAG CCATCGAAAG CACAAAGACT CAGGATTATG TCGCCGCCGC CGCGTACTGG
GGCGCGCTTT ACGAGCGCTC GCCCGACGAT GCTGTTACGA CCGTCAACTA TTCCAAGGCG
CTCCGGCAGA TAGGTTCGAT TGCACAGTCG CTCACCGTGA TGCAGCGCGC TCAGATAAAA
CATCCCGAGA ACGCGGATGT GCTCGCCGAA GCCGGCAAGG CTCTGGCCGC GAGCGGCAGG
CCGGACCAGG CGGTTGCGAT GCTGGAAACC GCCGCCCGCA AATCGCCGCA AGACTGGAGT
ATTCGCTCGG CCCTCGGCGT AGCGCTCGAT CAGACGGGCC GATACGAGGA AGCCAAGAGC
CGCTACAACG AAGCGCTCGA ACTTTCGCCC GACAACCCGT CCGTACTTAC CAACCTCGGC
CTTTCCTATG CGCTGACGGG AGATCTCGAC ATGGCCGAGC GGACACTCCG CAAGGCCGTC
GCAGATACCC GCGCCGACGC TTACGCGCGG CAAAATCTCG CCATCATTCT CGGCCTCAAG
GGAAACTTCG ATGAGGCTGA ACGGCTGGCA CGCGCCGACC TGCCTGCCAA CGTTGCAGAC
GGCAACATCG CCTATCTCCG TTCCATGCTT GCGCAACCGG CATTGTGGAA ACAGCTCGAA
GAGCTTGACC GGCAGCCTGA CACGACAGCA CCCGCACCTC AGCCAACCGG CAAACAACCT
GCCGCTGCGA AAGAAAGCCG TAACGAGAAG GAAGACCAGG TATCGTCGCT GCCGCCGGAG
ACCCGCGTTT CAATTTACTA G
 
Protein sequence
MSCFADQTPQ FTFRAKRGGW LAAASLCSAL ALSGCASTST TSTAQTPSEA AQAQIDTPAL 
RNAAIESTKT QDYVAAAAYW GALYERSPDD AVTTVNYSKA LRQIGSIAQS LTVMQRAQIK
HPENADVLAE AGKALAASGR PDQAVAMLET AARKSPQDWS IRSALGVALD QTGRYEEAKS
RYNEALELSP DNPSVLTNLG LSYALTGDLD MAERTLRKAV ADTRADAYAR QNLAIILGLK
GNFDEAERLA RADLPANVAD GNIAYLRSML AQPALWKQLE ELDRQPDTTA PAPQPTGKQP
AAAKESRNEK EDQVSSLPPE TRVSIY