Gene Plav_3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3223 
Symbol 
ID5453987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3441050 
End bp3442228 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID640878812 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_001414486 
Protein GI154253662 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0966966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC AGGAACTCCG CAATTATCAC CTGAATTTCG GTCCGCAGCA TCCGGCCGCG 
CATGGTGTGC TGCGCCTCGT GCTGGAGCTC GATGGCGAAG TCGTCGAGCG CGTGGACCCG
CATATCGGTC TTCTGCACCG TGGCACGGAA AAGCTGATCG AGTACAAGAC CTATCTTCAG
GCGACCCCAT ACTTCGACCG GCTCGATTAC GTTGCGCCGA TGAACCAGGA GCACGCCTTC
GTGCTTGCCG CCGAGCGTTT GCTTGGCCTC GAGGTGCCGC GCCGCGCCCA GTTCATCCGC
GTCCTCTATT CCGAGATCGG CCGCATCCTC GCGCATTTGC TCAACGTAAC GACGCAGGCG
ATGGACGTCG GCGCGCTCAC CCCGCCGCTT TGGGGCTTTG AAGAACGCGA AAAGCTCATG
ATCTTTTATG AGCGTGCGTC CGGAGCCCGC CTTCACGCAA ATTATTTCCG CACGGGCGGT
GTGCATCGCG ACCTGCCGCC AAAGCTGCTC GAGGATATCT ACAATTTTTG CGACCCGTGT
TCGCAGGTGC TGGACGATCT CGAAGGTCTC ATCACCGACA ACCGCATCTT CAAGCAGCGC
AACGTCGACA TCGGCGTCGT CTCGCAGGAA GAGGCGCTGG AGTGGGGCTT CTCCGGCGTC
ATGGTGCGCG GCTCAGGCAT GGCCTGGGAC CTGCGCCGCG CGCAGCCCTA TGAGGTTTAT
TCCGAACTCG ATTTCGACAT TCCCGTAGGC AAGAACGGCG ACTGCTACGA TCGCTATCTC
TGCCGCATGG AAGAAATGCG CCAGTCCTTG CGCATCATGA AGCAGTGCAT CGAGTTGATG
CCGGGTGGTC CTGTGCATGT GCTCGATGGC AAGGTCGTGC CGCCGTCGCG CAGCGAGATG
AAGCGCTCGA TGGAAGCGCT TATTCATCAC TTCAAGCTTT ATACCGAGGG CTACCACGTG
CCCGCCGGCG AGGTTTATGC CGCCGTCGAA GCGCCCAAGG GCGAGTTCGG CGTCTACCTC
GTGTCGGATG GTGGTAATAA GCCTTACAAG TGCAAGATCC GTGCTCCCGG CTACGCGCAT
CTTCAGGCCA TGGACCATCT CTGCAAGGGT CACATGCTTG CGGACGTATC GGCCATTCTC
GGTTCTATCG ACATCGTTTT CGGAGAGGTG GACCGGTGA
 
Protein sequence
MAEQELRNYH LNFGPQHPAA HGVLRLVLEL DGEVVERVDP HIGLLHRGTE KLIEYKTYLQ 
ATPYFDRLDY VAPMNQEHAF VLAAERLLGL EVPRRAQFIR VLYSEIGRIL AHLLNVTTQA
MDVGALTPPL WGFEEREKLM IFYERASGAR LHANYFRTGG VHRDLPPKLL EDIYNFCDPC
SQVLDDLEGL ITDNRIFKQR NVDIGVVSQE EALEWGFSGV MVRGSGMAWD LRRAQPYEVY
SELDFDIPVG KNGDCYDRYL CRMEEMRQSL RIMKQCIELM PGGPVHVLDG KVVPPSRSEM
KRSMEALIHH FKLYTEGYHV PAGEVYAAVE APKGEFGVYL VSDGGNKPYK CKIRAPGYAH
LQAMDHLCKG HMLADVSAIL GSIDIVFGEV DR