Gene Plav_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2042 
Symbol 
ID5453765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2228323 
End bp2229462 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID640877619 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001413313 
Protein GI154252489 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.419009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00181161 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATACCG GCATGGACAT GCCGTTCAAG GACGCGGAGA ACGCGGCGCT TTTCCGCATG 
CTGATTGCGA CGGCGGTGGA TGGCATCATC GTCATCGACG CCGCCGGTAA TGTGCGGCTC
TACAACCAGG CCTGCGAGCG CCTGTTCGGC TATACACCGG AGGAAGTTCT CGGCCGCAAC
ATCAAGATGC TGATGCCCTC GCCTTACCGC GACGAGCATG ACGGCTATCT CAGCGCCTAT
CACAAAACCG CCAACCGCAA GATCATCGGC ATCGGCCGCG AGGTGATGGG TCAGATGAAG
GACGGCTCGA CCTTCCCCAT GTATCTCTCC GTGGGAGAGG GCGTCTTCGA AAACCAGCGC
ATCTTCGTCG GCATCATTCA CGACACATCA GAGCGCCACT TCGCCGCCGG CCGCCTGCGG
GAAGTGCAGG AAGAGCTTGT CCACGTCTCG CGTACGAACG AAATGGGCCA GATGACCTCG
GCCCTCGCGC ATGAGCTGAA CCAGCCGCTC ACCGCCATCA TGAATTATGT CCGCGCGGCG
GCACGGACGA TCGAGGGGCT CGACGATCCG GCTGCAATCC GCGCGCGTGA ACTCATCGAC
AAGGCCGCCG AGCAGACCGG CCGCGCCGGC CAGATCATCC GGCGGCTGCG CGATTTCGTG
GAGAAACGGG AAGCCTTCCG CGCCGAGGAA GACCTGAACC AGACGATCAG GGAGGCGATT
GCGCTCGCCC TGATCGGCGC CTCGGACAGC AATGTGAAGG TCGCTGTCGA TTTCGACCCG
AACATTTCCC CTGTCATCAT GGACCGTATC CAGATACAGC AGGTCGTCGT CAATCTCGTC
CGCAACGCCG TCGAGGCAAT GCAGAGCGTC GAGCGGCGCG AAATCCTCAT AAAGACGGGA
ACGGACGAAC CCGGCTATAT CTACGCGTCC TTTGCCGATA CCGGACCAGG ACTGCCGGAA
GAGGTGGCGA GCCGGCTGTT CCAGCCCTTC GTCACGACCA AGGAGAAGGG AATGGGCATC
GGCCTCTCGA TCTGCCAGAC CATCGTGAAA GCACATGGCG GCCGCCTCTG GGCCGCGCGC
GACGGAAGCG ACGGCACCAC CTTCCGCTTC CGTCTGCCCG CAGCGGCGCA GAACCTATGA
 
Protein sequence
MDTGMDMPFK DAENAALFRM LIATAVDGII VIDAAGNVRL YNQACERLFG YTPEEVLGRN 
IKMLMPSPYR DEHDGYLSAY HKTANRKIIG IGREVMGQMK DGSTFPMYLS VGEGVFENQR
IFVGIIHDTS ERHFAAGRLR EVQEELVHVS RTNEMGQMTS ALAHELNQPL TAIMNYVRAA
ARTIEGLDDP AAIRARELID KAAEQTGRAG QIIRRLRDFV EKREAFRAEE DLNQTIREAI
ALALIGASDS NVKVAVDFDP NISPVIMDRI QIQQVVVNLV RNAVEAMQSV ERREILIKTG
TDEPGYIYAS FADTGPGLPE EVASRLFQPF VTTKEKGMGI GLSICQTIVK AHGGRLWAAR
DGSDGTTFRF RLPAAAQNL