Gene Plav_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2024 
Symbol 
ID5456808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2211244 
End bp2212737 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content62% 
IMG OID640877601 
Productsignal transduction histidine kinase 
Protein accessionYP_001413295 
Protein GI154252471 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.149212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.406004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC TGGGTCTGTC CATGCGCGTT TTCGGCGCCA GAGACGCCGA TACGGACGAT 
ATGAGGCGGG TGAACGCGGC GTTGATGACG CTCGCGTTCC TGCTGCTGTT CGGCGCCAGC
GCGGCCGCGC TGATGCTGGC CGACCGCGCG CGGACGAATG CCGAAGCGGT ATCTCAAACA
ATCGAAGTGC GCGCCGACCT CCTTACCATC CTCAATCTGC TTCAGGGCAC GGAAACCGGC
CAGCGCGGCT TTCTGCTGAC ACGCGATTCG AAGTATCTGG AGCCCTACAA CGAGACGCGG
TTCGAGATCG GCGAGAAGCT GACAGCACTT GAGGCGCGGG TTGCCGATAA TCCTGGTCAG
CAGAAGCGCA TCGAGGCGCT CCACGACACA GTCCGCCAGC GCATTGCCAT CCTCGACGGC
ACGATCGCCA TGGCGCGTGA AGGGGACTTC GAGGGCGCGA GCAAAATCGT CGGGAATGAC
CGCGGCAAGG CGCTGATGGA TGAGTCCCGC AGGCTGGTGG ATGAGGCGAT TTCCGAAGAG
AACGCGCTTC TCGCTGCACG CCAGGAAAGC GCGACCCGCT CGCATACATG GCTGTTCACG
GCGCTTGTCG GCGCGCTCGT CGCCTCATTC ATCCTTGCTT TCCTGGCGCT GCAACTGACC
CGCCGCCAGT TCATGAACAT GAAAACGAGG CGCGACCAGC TACTGCGGCT GAACGAAGAA
CTGGAGAGGC GCGTCGCCGA ACGGACCGCC GACCTTGAAA GGGCCCGCGA GCTTGCCGAA
AGCGAAGCAG GCCGCGCCGA ATATGAACGG GGACGCGTGG AGCTTCTGTT GCGCGACGTC
ACGCATCGTG TCGGCAACAA CCTCGCTATG GTTTCATCGC TGTTGCGCAT GCAGCAGTCG
AAAGTCAATG ACGGGGAAGC CCGTTCGGCA CTCGAAACCG CGCGCGGTCG CATTCAGACG
ATCAGCACCG CGCAACGCCG CCTCCGGCTG GGGGCCGACC TGCAATCCAC CCGCGCCGAT
GAATTGCTTG AAGCCGTGGT TTCCGACCTT GCCGATTCGA CGCTGGAGAA CGGCACGCTC
ACCATTGTGA GCGACTTTCA GCCGCTCATT GTAGCCTCAC GGGACACCAC GACACTTGCG
GTGGTGCTGG GTGAGCTAGT ATCAAACGCC ATCAAGCACG CCTTTCAAGG CCGAAGCAGC
GGCGAGATCA AAGCCTCGTT CACGCTCGGC CCCGATGGCA TTCCCCTACT TGCCGTCATC
GACGACGGGA TCGGGATGGA GGCGGCGACG TCCGGCAAAC CGGAGCACCC CGGCCTCGGC
ACCACGATTA TCGATAACCT CTCGCGCCAG TATGGCGGCG AAATCAAGAG ACACACCAAC
GAAGCCGGCG GCACATCGAT CTTCATCACG TTGCCCAAGC TTCAAGTCAA ACACCCGGAC
CCGGTTTCCG AACCTCAGAA TTCTATCTCC GAGACCAAAG GCACCGAACA ATGA
 
Protein sequence
MTGLGLSMRV FGARDADTDD MRRVNAALMT LAFLLLFGAS AAALMLADRA RTNAEAVSQT 
IEVRADLLTI LNLLQGTETG QRGFLLTRDS KYLEPYNETR FEIGEKLTAL EARVADNPGQ
QKRIEALHDT VRQRIAILDG TIAMAREGDF EGASKIVGND RGKALMDESR RLVDEAISEE
NALLAARQES ATRSHTWLFT ALVGALVASF ILAFLALQLT RRQFMNMKTR RDQLLRLNEE
LERRVAERTA DLERARELAE SEAGRAEYER GRVELLLRDV THRVGNNLAM VSSLLRMQQS
KVNDGEARSA LETARGRIQT ISTAQRRLRL GADLQSTRAD ELLEAVVSDL ADSTLENGTL
TIVSDFQPLI VASRDTTTLA VVLGELVSNA IKHAFQGRSS GEIKASFTLG PDGIPLLAVI
DDGIGMEAAT SGKPEHPGLG TTIIDNLSRQ YGGEIKRHTN EAGGTSIFIT LPKLQVKHPD
PVSEPQNSIS ETKGTEQ