Gene Plav_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1938 
Symbol 
ID5454753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2112189 
End bp2114183 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content65% 
IMG OID640877515 
ProductBeta-galactosidase 
Protein accessionYP_001413210 
Protein GI154252386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.949653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.422519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAG CTCAAAGCCT CACCGACGTC TCGATCGGCG TCTGCTATTA CCCGGAGCAC 
TGGCCGGAAG CCATGTGGCC GGAGGATGCC CGGCGCATGC GTGAGGCGGG GATTTCCCGC
GTCCGCATCG GCGAGTTCGC CTGGTCCCGG CTGGAGCCGG AGCCGGGAAC TTACGATTTC
GACTGGCTAT CCCGCGCGCT CGATACGCTG CATGGCGAGG GCCTCCAGGT CGTTCTCGGG
ACGCCGACGG CAACGCCCCC CAAATGGCTG GTCGATTCGA TGCCCGACAT GTTGCCGGTA
GACCGGCATG GGCGGCCCCG CGGCTTTGCG TCGCGCCGCC ACTACTGCTT CTCCCATGCC
GGATACCGGC GCGAATGTGC GCGTATCGTG AAGGCGCTGG CTGAGACATT CGGGCAGCAT
CCGGCCATCG TCGCCTGGCA GACCGACAAT GAATATGGCT GCCACGACAC GGTGCTTTCC
TATTCCGACG AGGCGCGGCT CGGCTTTCGT CATTGGCTGC GCGATCGCTA CGGATCGGTC
GCCGCGCTCA ACGATGCCTG GGGCAATGTC TTCTGGAGCA TGGAATATCG CAGCTTCGAT
GAGATCGACC TGCCTTCCGG CACGGTCACG GAGGCGAACC CGGCACATCG CGCCGATTTT
CATCGCTATT CGTCGGATCA GGTCGCGGCG TTCAATCGCG TGCAGGTCGA AATCATCCGC
GCCCTTTCGC CCGGCCGCGA CATTCTCCAC AATTTCATGA CCTTCTTTCT CGACTTCGAT
CACTACGAGG TGATGAAGGA CCTCGATATT GCAACCTGGG ATTCCTATCC GCTGGGAAGT
CTCGACGTCT TTCCGGGCGA CGCCGCTCAC AAGACCGCCT TTATGCGCAC GGGCGATCCC
GATCTTCAGG CATTCCACCA CGATCTTTAT CGGGGTGCCG GCCGTGGCCG CTTCTGGGTC
ATGGAGCAGC AGCCCGGCCC GGTGAACTGG GCGCATTACA ACGTGGATGC GCAGCCCGGT
CTCGTCCGTC TCTGGGGGCT TGAGGGCTTC GCGCATGGTG CGGAGACGAT TTCTTACTTC
CGCTGGCGGC AGGCGCCTTT CGCGCAGGAG CAGTTTCATG CCGGGCTCAA CCTGCCGGAT
GGCGAACCGG ACCGCGCCTT CCATGAAGTG AAGCAGCTCT CGCAGGATCT CGCCTCGCTC
GGTCCGCTCG GTGCGTCCGC TTCCGCACGG GTGGCGCTCG TCTTTTCCTA TGAGGCGGCA
TGGTTCCTCC GCGTGCAGCC GCAAGGGCGC AGCTTCTCCT ATGTGGAGCA GGTCTTTGCG
ATGTACCGCT CGCTCCGCCG TCTCGGGCTC GATGTCGATG TCGTCGGGCC CGATGCCGAA
GTGAGCGGTT ACGCGCTTGT CCTCGTTCCT TCCATGCCGC ATGTGCCGGA GCGTCTCGCC
GCGTCGCTCG CGGCTTGCAA GGGAACGCTC CTCATTGCCG CACGCAGCGG CAGCCGCACA
GCGTCTCACC GGATACCGGA CAATCTCGCT CCCGGCATCC TCTCCGGCCT GCTCGGCGTA
AAGGTCACGC GCGCCGAGAG TTTCCGGCCC CACGGCGCCG TTCCCGTCCG CTACGACAAT
GAAAACTACA GCTTCGACCG CTGGCGCGAA TTCGTCGTTC CGGAGGCGGG CGTCGAGGTG
CTGGCGGAGA CGGAAGACGG GCATCCGGCC TTCACCCGCA AGGGCCGCGC GCATTACCTC
GCCGGCTGGC CGGACGACGC ATTCCTCAAC GGGGTAGTGG AGCGGCTGGC GCGGGAGGCG
GGCCTCGCAA CGGGCGAGCT TCCAGCCGGC TTGCGCAGCC GCCGGCGTGG GCCTTACCGC
TTCGTTTTCA ACTACGGCCC GGCCGCCGCC GACATCTCGC CTTATTTCCC CGCTAGTGAG
TTTGTGCTCG GCCAACCCCG GCTTGAAGTC GGCGGTGTGG CGGTTCTGCA TACGGACGTT
CCAGCGAACG GTTAA
 
Protein sequence
MPQAQSLTDV SIGVCYYPEH WPEAMWPEDA RRMREAGISR VRIGEFAWSR LEPEPGTYDF 
DWLSRALDTL HGEGLQVVLG TPTATPPKWL VDSMPDMLPV DRHGRPRGFA SRRHYCFSHA
GYRRECARIV KALAETFGQH PAIVAWQTDN EYGCHDTVLS YSDEARLGFR HWLRDRYGSV
AALNDAWGNV FWSMEYRSFD EIDLPSGTVT EANPAHRADF HRYSSDQVAA FNRVQVEIIR
ALSPGRDILH NFMTFFLDFD HYEVMKDLDI ATWDSYPLGS LDVFPGDAAH KTAFMRTGDP
DLQAFHHDLY RGAGRGRFWV MEQQPGPVNW AHYNVDAQPG LVRLWGLEGF AHGAETISYF
RWRQAPFAQE QFHAGLNLPD GEPDRAFHEV KQLSQDLASL GPLGASASAR VALVFSYEAA
WFLRVQPQGR SFSYVEQVFA MYRSLRRLGL DVDVVGPDAE VSGYALVLVP SMPHVPERLA
ASLAACKGTL LIAARSGSRT ASHRIPDNLA PGILSGLLGV KVTRAESFRP HGAVPVRYDN
ENYSFDRWRE FVVPEAGVEV LAETEDGHPA FTRKGRAHYL AGWPDDAFLN GVVERLAREA
GLATGELPAG LRSRRRGPYR FVFNYGPAAA DISPYFPASE FVLGQPRLEV GGVAVLHTDV
PANG