Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1321 |
Symbol | |
ID | 5454114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 1456888 |
End bp | 1458777 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640876892 |
Product | heparinase II/III family protein |
Protein accession | YP_001412598 |
Protein GI | 154251774 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.213971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000000859361 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGAGA CGCGGACCGG ACGGGATGGA AGAGGAGCGG GGCCGCGCAC GCGCCCGGCC CCGGGCAACC GCGCCATCGA CCTTGCGGGC GCCGCCCTTA TCCGCACCCG CGACGCCGCC CTCGCCCGCA TCTTCAGCAC CTGGGCCTAT GGCCAGACGC TGCACGGGCA GATGCCCGAC CACATGGTGC TTTATCCGCC GGAACTGCGG CCGGGCCGCG CCTCCGCCGC CGACGCCTTG TTTCAGGGCC GCTGGTTCCT GCCGGGCGGA CAAGTGCGCG CGCCGGGCGG CTCCTCGCCC TTTGCCGCCG ATCCGCCGAG CGAGAGATGG GCGGAGGAAC TGCACGGCTT TTCCTGGCTG CGCCATTTCA CCACCGCGCA TGAAGCGAAC AATGGCGACG CCGCACGCCA ATATGCGCAG AAGCTCGTCG CCGACTGGAT CGCGACCGAG GGCAACTGGC ATCCCGTCGC GTGGCGGCCG CAGGTCATCG GAAGGCGGCT CATCTCATGG GTCGCGAACG GCGCGCTCAT CATCGACGGC ACCGAACTGG TCTATCGCTC GACGCTCCTG CGCAACATGG CGCGGCAGGC ACGCCACCTC GCCCGCGTCG CCGCCATTGC CCCGGCGGGC GAACCGCGCA TCACGGCCGC GATGGGTCTT GCCTTTTCCG GCCTCTGCCT GTCGGAAGGC CACAAGCGGC TGACAAAAGG CGTGCAGCTT CTCTGCCGCG AACTCGACCG GCAGATCCTC GCCGATGGCG GGCATGTGAG CCGCAACCCT TCGGCGCAGC TTTCGATCCT TCTCGATCTG CTGTCGCTGC GCGATGCGCT GACCGCGCGC AACATCGAGG TGCCGAAGCC GGTGCGCGAC GCCATCGACC GGATGACGCC GATGCTGCGC TTCTTCCGCC ATGGCGACGG CAGGCTGGCA CTCTTCAACG GCTCGACCGA GGGGCCGGAA GGCGCGGTCG ATGCCGCGCT CGACCGCGAC GACACGAAGG GGAAACCCTT CGGCTTCGCG CCGCATTCCG GCTATCAGCG CCTCGCCGCG GGAAACGTGA ACCTGATCGC CGATACCGGC ACCGCGCCGC CCGGCGCCTA TAGCCACGAG GCACATGCAG GCTGCCTTTC ATTCGAGATG AGCGTCGGCC GCAACCGCAT GATCGTGAAT TGCGGCGCGA CGAAAGTGCT CGGGCCCGAC TGGGAGGCGG CGAGCCGCGC CACCGCCGCG CATTCGACTC TGGTGCTGAA CGACACCTCC TCGGCGCGGA TGCTGAAAGG GAAAATCTCA CGCACCCTGC TCGGCCCCCG CGTGATGGAG GGCCCGGTCG AGGTCGAAAG CCGCCGCAAC GAGAACGAGG CGGGTGTCTG GCTCGATACC GGCCATGACG GCTATGCGGG GCTTTTCGGC CTTATCCATC GCCGCCGCCT GTTTCTCTCG GCGACCGGCG AGGATCTGCG CGGCGAGGAC ATGCTGGAAA CGGCCCAGGC GCGCCGCCGC CCGAAGCCCT GGAACCCGCT CTACTGGCAC AAGGTGCCGG AAGACCCGGA CTTCGCCATC CGCTTCCACA TCCACCCCGA TGCGCGCGTT TCGCTCGCCC ATGACCGCAG CAATGTGCTG GTGCTGCTGC CGAACGGCGA TGGCTGGCAA TTCCGCGCCC GCTCCGGCGG CGGCGAATGC GAAATCGACA TCGAGGAAAG CGTCTATCTC GGCTCCGGCG ACACGACGCG CCGCGCCGAG CAGATCGTCG TGACGGGCCG CGTCTTCCGC GGCGAGGCGC GCGTCAACTG GGCCTGGCGG CGGCTTTCGA CCCGCAATGC CGGGCACCCG AAACGCGAGG ACGCGGAAGT GCCCGAACTG CCGGAGCTGG AATTTCCGGC GGGGGAGTGA
|
Protein sequence | MIETRTGRDG RGAGPRTRPA PGNRAIDLAG AALIRTRDAA LARIFSTWAY GQTLHGQMPD HMVLYPPELR PGRASAADAL FQGRWFLPGG QVRAPGGSSP FAADPPSERW AEELHGFSWL RHFTTAHEAN NGDAARQYAQ KLVADWIATE GNWHPVAWRP QVIGRRLISW VANGALIIDG TELVYRSTLL RNMARQARHL ARVAAIAPAG EPRITAAMGL AFSGLCLSEG HKRLTKGVQL LCRELDRQIL ADGGHVSRNP SAQLSILLDL LSLRDALTAR NIEVPKPVRD AIDRMTPMLR FFRHGDGRLA LFNGSTEGPE GAVDAALDRD DTKGKPFGFA PHSGYQRLAA GNVNLIADTG TAPPGAYSHE AHAGCLSFEM SVGRNRMIVN CGATKVLGPD WEAASRATAA HSTLVLNDTS SARMLKGKIS RTLLGPRVME GPVEVESRRN ENEAGVWLDT GHDGYAGLFG LIHRRRLFLS ATGEDLRGED MLETAQARRR PKPWNPLYWH KVPEDPDFAI RFHIHPDARV SLAHDRSNVL VLLPNGDGWQ FRARSGGGEC EIDIEESVYL GSGDTTRRAE QIVVTGRVFR GEARVNWAWR RLSTRNAGHP KREDAEVPEL PELEFPAGE
|
| |