Gene Plav_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2160 
Symbol 
ID5454755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2341247 
End bp2342596 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content64% 
IMG OID640877737 
ProductHipA domain-containing protein 
Protein accessionYP_001413431 
Protein GI154252607 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCG CACAGCCCGA CATTCTGGAC GTGCGCTTGG AAGCCGCCGA CATTCCACTC 
GGGCATCTGG CGCGCAAGGA TGGCGGCTGC CGCTTCGCTT ATACGCCAGA CTATCTCGCG
CGCACCGACG CCATGCCGCT CTCGTTGTCC CTGCCGCTGC GCGAGGAACC TTACGGCGAT
GTCGAGTCAC GCGCCTTTTT CGACAACCTG CTACAGGAAA ACGATCAGCT TCAGCAGACG
ATGGACCGCG AACGTATCGC CCGCGACGAC ATTGTCGGCC TGCTGAGTTT TGTCGGCGGC
GACTGCGCGG GCGCGATCAG TTGCCTGCCG CCGGGAAGCG GCCCGGTCAA GGTTCCGGGT
AACCTCGCCA CGGATTACGA CGAACTGCCG CGCGAGGAAC TCATCGATAT CATGCGCCGC
CTCGCAGACA AGTTGCCCTT ACCGGACGCC ATAAAAGACC CGTCGCCGGT TGCAGGCGTG
CAGCGCAAGA TCGCGCTGAC AGAAATCGCG AAGGGCCGTT TCGGCCTGCC GAAGCCCGGC
CTGCGCGTGC CGACGACGCA TATATTGAAG GTGCCGGAGC GCCGACTGGC ACGCGAAGTG
TTGCTGGAAG CCGTTGCCAC GCGACTGGCG CATGCCGTGG GGCTCGAGGT CGCCATACCG
GCGGAATTCA AACTCGACGA CGAAGCCGGC TTGTTGAGCT TGCGCTTCGA CCGCCGCATC
GACGTCAATA GCGTGACGCG CATCCACCAG GAAGATTTCT GTCAGGCATT GGGCCTGCCT
GCTCGCCTCA AATATGAGCG CAACGGAACG GCAGAATTGC GCTTCGCGGC CACCGCCGTT
TCAGACCTGC TCGGCCGCAC CGCCGCGCCG GCAAGAGACA AGCGCAGCTT CATGGTGTCG
GCGTTTTTCA ATCTCGCGAT CGGCAATAAC GACAATCACG CGAAGAACCA TGCGCTGCTT
TACGACACGG GGCCTATACC GCGCCTTGCG CCGCTTTACG ACCTGCAACC CGTGCGGCTG
ACCGGGCGCT ATACCGACGA GCTGGCCTTC AGGATCGGCG CCGCCGACCG CTTCGACACG
GTGACGGCGG GCGACATCGC CGCTTTCATG GCGGCTTTCG GGCCGGGCAC GGCGACGGCG
CAGCGGCGCT TCATCGAGGG TGAAGTCGGG CCGATGCTGG CGGAGCTGGA CAGGCAGACG
GCTGTGCTGC CTTCGCATGG ATTGAAGGAT TTCGACGATC TGATCGGCCG CGAAATCTCG
CAGCTTGCGG ACCTGCTTGG GCTCAAGCTG GCGACCAGGG AGCGGGACTA TTTCACGGAA
AAGGGCGGCG GCTGGGGACA ATTGAGCTGA
 
Protein sequence
MSAAQPDILD VRLEAADIPL GHLARKDGGC RFAYTPDYLA RTDAMPLSLS LPLREEPYGD 
VESRAFFDNL LQENDQLQQT MDRERIARDD IVGLLSFVGG DCAGAISCLP PGSGPVKVPG
NLATDYDELP REELIDIMRR LADKLPLPDA IKDPSPVAGV QRKIALTEIA KGRFGLPKPG
LRVPTTHILK VPERRLAREV LLEAVATRLA HAVGLEVAIP AEFKLDDEAG LLSLRFDRRI
DVNSVTRIHQ EDFCQALGLP ARLKYERNGT AELRFAATAV SDLLGRTAAP ARDKRSFMVS
AFFNLAIGNN DNHAKNHALL YDTGPIPRLA PLYDLQPVRL TGRYTDELAF RIGAADRFDT
VTAGDIAAFM AAFGPGTATA QRRFIEGEVG PMLAELDRQT AVLPSHGLKD FDDLIGREIS
QLADLLGLKL ATRERDYFTE KGGGWGQLS