Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1938 |
Symbol | |
ID | 5454753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 2112189 |
End bp | 2114183 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640877515 |
Product | Beta-galactosidase |
Protein accession | YP_001413210 |
Protein GI | 154252386 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.949653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.422519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAAG CTCAAAGCCT CACCGACGTC TCGATCGGCG TCTGCTATTA CCCGGAGCAC TGGCCGGAAG CCATGTGGCC GGAGGATGCC CGGCGCATGC GTGAGGCGGG GATTTCCCGC GTCCGCATCG GCGAGTTCGC CTGGTCCCGG CTGGAGCCGG AGCCGGGAAC TTACGATTTC GACTGGCTAT CCCGCGCGCT CGATACGCTG CATGGCGAGG GCCTCCAGGT CGTTCTCGGG ACGCCGACGG CAACGCCCCC CAAATGGCTG GTCGATTCGA TGCCCGACAT GTTGCCGGTA GACCGGCATG GGCGGCCCCG CGGCTTTGCG TCGCGCCGCC ACTACTGCTT CTCCCATGCC GGATACCGGC GCGAATGTGC GCGTATCGTG AAGGCGCTGG CTGAGACATT CGGGCAGCAT CCGGCCATCG TCGCCTGGCA GACCGACAAT GAATATGGCT GCCACGACAC GGTGCTTTCC TATTCCGACG AGGCGCGGCT CGGCTTTCGT CATTGGCTGC GCGATCGCTA CGGATCGGTC GCCGCGCTCA ACGATGCCTG GGGCAATGTC TTCTGGAGCA TGGAATATCG CAGCTTCGAT GAGATCGACC TGCCTTCCGG CACGGTCACG GAGGCGAACC CGGCACATCG CGCCGATTTT CATCGCTATT CGTCGGATCA GGTCGCGGCG TTCAATCGCG TGCAGGTCGA AATCATCCGC GCCCTTTCGC CCGGCCGCGA CATTCTCCAC AATTTCATGA CCTTCTTTCT CGACTTCGAT CACTACGAGG TGATGAAGGA CCTCGATATT GCAACCTGGG ATTCCTATCC GCTGGGAAGT CTCGACGTCT TTCCGGGCGA CGCCGCTCAC AAGACCGCCT TTATGCGCAC GGGCGATCCC GATCTTCAGG CATTCCACCA CGATCTTTAT CGGGGTGCCG GCCGTGGCCG CTTCTGGGTC ATGGAGCAGC AGCCCGGCCC GGTGAACTGG GCGCATTACA ACGTGGATGC GCAGCCCGGT CTCGTCCGTC TCTGGGGGCT TGAGGGCTTC GCGCATGGTG CGGAGACGAT TTCTTACTTC CGCTGGCGGC AGGCGCCTTT CGCGCAGGAG CAGTTTCATG CCGGGCTCAA CCTGCCGGAT GGCGAACCGG ACCGCGCCTT CCATGAAGTG AAGCAGCTCT CGCAGGATCT CGCCTCGCTC GGTCCGCTCG GTGCGTCCGC TTCCGCACGG GTGGCGCTCG TCTTTTCCTA TGAGGCGGCA TGGTTCCTCC GCGTGCAGCC GCAAGGGCGC AGCTTCTCCT ATGTGGAGCA GGTCTTTGCG ATGTACCGCT CGCTCCGCCG TCTCGGGCTC GATGTCGATG TCGTCGGGCC CGATGCCGAA GTGAGCGGTT ACGCGCTTGT CCTCGTTCCT TCCATGCCGC ATGTGCCGGA GCGTCTCGCC GCGTCGCTCG CGGCTTGCAA GGGAACGCTC CTCATTGCCG CACGCAGCGG CAGCCGCACA GCGTCTCACC GGATACCGGA CAATCTCGCT CCCGGCATCC TCTCCGGCCT GCTCGGCGTA AAGGTCACGC GCGCCGAGAG TTTCCGGCCC CACGGCGCCG TTCCCGTCCG CTACGACAAT GAAAACTACA GCTTCGACCG CTGGCGCGAA TTCGTCGTTC CGGAGGCGGG CGTCGAGGTG CTGGCGGAGA CGGAAGACGG GCATCCGGCC TTCACCCGCA AGGGCCGCGC GCATTACCTC GCCGGCTGGC CGGACGACGC ATTCCTCAAC GGGGTAGTGG AGCGGCTGGC GCGGGAGGCG GGCCTCGCAA CGGGCGAGCT TCCAGCCGGC TTGCGCAGCC GCCGGCGTGG GCCTTACCGC TTCGTTTTCA ACTACGGCCC GGCCGCCGCC GACATCTCGC CTTATTTCCC CGCTAGTGAG TTTGTGCTCG GCCAACCCCG GCTTGAAGTC GGCGGTGTGG CGGTTCTGCA TACGGACGTT CCAGCGAACG GTTAA
|
Protein sequence | MPQAQSLTDV SIGVCYYPEH WPEAMWPEDA RRMREAGISR VRIGEFAWSR LEPEPGTYDF DWLSRALDTL HGEGLQVVLG TPTATPPKWL VDSMPDMLPV DRHGRPRGFA SRRHYCFSHA GYRRECARIV KALAETFGQH PAIVAWQTDN EYGCHDTVLS YSDEARLGFR HWLRDRYGSV AALNDAWGNV FWSMEYRSFD EIDLPSGTVT EANPAHRADF HRYSSDQVAA FNRVQVEIIR ALSPGRDILH NFMTFFLDFD HYEVMKDLDI ATWDSYPLGS LDVFPGDAAH KTAFMRTGDP DLQAFHHDLY RGAGRGRFWV MEQQPGPVNW AHYNVDAQPG LVRLWGLEGF AHGAETISYF RWRQAPFAQE QFHAGLNLPD GEPDRAFHEV KQLSQDLASL GPLGASASAR VALVFSYEAA WFLRVQPQGR SFSYVEQVFA MYRSLRRLGL DVDVVGPDAE VSGYALVLVP SMPHVPERLA ASLAACKGTL LIAARSGSRT ASHRIPDNLA PGILSGLLGV KVTRAESFRP HGAVPVRYDN ENYSFDRWRE FVVPEAGVEV LAETEDGHPA FTRKGRAHYL AGWPDDAFLN GVVERLAREA GLATGELPAG LRSRRRGPYR FVFNYGPAAA DISPYFPASE FVLGQPRLEV GGVAVLHTDV PANG
|
| |