Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3381 |
Symbol | |
ID | 5453536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3620287 |
End bp | 3621537 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640878971 |
Product | hypothetical protein |
Protein accession | YP_001414642 |
Protein GI | 154253818 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.101436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTGG ACGACGCTGC ACCACGAGCA CCACGCCAAG GCCCGGTCGC CCTCGCTGAT CTGTTCGACA GCGCGCTGAA AGACCTTGCA CCGAAACCTC GCCCGCCCGC GTCCGCGCCA GCTCCCGCAT CCGCAACATC TCCTGTCTCC GCAACCGGCG ACGGTTTTCT GTTCAGCGGC AATCGGCACG AGAGCGTGCC GCGACGGCTG TTCCTGGACC GACGCCTGAC GCCGCTGGAA CGCAATGCCT GGCAAGTCTT TCGGATGATG CTCAACGAGG ATGGGGTCAC GGCGTTTCCG ACGTATGAGC AACTTCGCCC GTGGCTCGCG TCAATGCCCT GCGCCGGACA GGCCTCTCAC GAGACCGTGG CACGGGCGCT GACGCTGTTG CGCCTGACTC GCTGGTTGAG TCTCGTGCGC CGCAGGCGTG ACCCCAAGAC CGGGCGCATC CTCGGCAACC TCTACGTCCT GCACGATGAG CCCCTGACGC CTTTCGAGGC GATGCAGCTC GACCCGGACT ACTTGGAGCT CGTCAGCCAA GCCCTGGGCC ATTCGGCCAA GGCCGTCCAG GTCGTGGGCT TGCACACCCT CAAGGAAATC GCCGAAGACC CGTTGTTGTC TGGCCGCACG TTGCCCTCGC GGCTGCAGGT CCTTGCCGAA CGCCTCGCGA GCCAGGGCAT TGGGTCGCAG GAGAGTTATC CACAGAAGGA TAGGGTTCAC GATTCCGAAG AAGGGGCGCC GAGCCTTCTT CGGAATTCTG ATGACCCCTC TTCGGATTCC GAAGCAGGGC CGAAACCCGC GTCAGACGGC GCTCTTCGGA ATCCGAAGCA GGACCGTACT GTACGTAGTA GTCGTATGAA TGAAGTACGT ACTACCGCGC GTGAGCGTGG GCAGGCGCGT GCGATGCCAG GAGTTCGCCT GCCTGATCGC TTTCTCGGCT TGAAGGAAGA GCAGCAGGCC GCCGCCATCG TGGCATTGCA GCAGGTCGAT GCTCCGCTAC GCCAGGCCGT GCTGGACGAA TGGGCGGATC GCTGTCGCGG CAGCACCATC CGCAACCCGG CAGGCTACCT GTTCGGCATC ATCCAGCGCG CCATCCGTGG CGAGTTCAAC GCCTGGGCCA AGCAAGCCGG GTCAGCACCG CCACCTGCCC CTGCACGAGA TGCACCGCCT GAACCACCGC GCAATGTGGT TCCACCCGAG GTGGCCCGGC AGCACATTGA CCGGCTGCGC GACCTTCTGC GCAGTAGCTG A
|
Protein sequence | MAVDDAAPRA PRQGPVALAD LFDSALKDLA PKPRPPASAP APASATSPVS ATGDGFLFSG NRHESVPRRL FLDRRLTPLE RNAWQVFRMM LNEDGVTAFP TYEQLRPWLA SMPCAGQASH ETVARALTLL RLTRWLSLVR RRRDPKTGRI LGNLYVLHDE PLTPFEAMQL DPDYLELVSQ ALGHSAKAVQ VVGLHTLKEI AEDPLLSGRT LPSRLQVLAE RLASQGIGSQ ESYPQKDRVH DSEEGAPSLL RNSDDPSSDS EAGPKPASDG ALRNPKQDRT VRSSRMNEVR TTARERGQAR AMPGVRLPDR FLGLKEEQQA AAIVALQQVD APLRQAVLDE WADRCRGSTI RNPAGYLFGI IQRAIRGEFN AWAKQAGSAP PPAPARDAPP EPPRNVVPPE VARQHIDRLR DLLRSS
|
| |