Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0718 |
Symbol | |
ID | 4021191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 806405 |
End bp | 808291 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637960907 |
Product | heparinase II/III-like |
Protein accession | YP_567857 |
Protein GI | 91975198 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.79901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCAG AGCTGGTTCA GACCAAGCTG CGTTGGACCG CAAAGGTCGC CGACGTCTTG CCGAATCTGA AATCTGCCGT CCGCAAGGCG CAGAAAGTCG ATGCGTTCCA TGAACCCGGG GCATTAGCGC GCGTGTCATC CGCTCAACGC AAACGGATCT CGAGCCTGAT CGTGGGCCGC TTCGCGCGGA ACATGGCGGC GCGTGTATCC GGCGGTTCGG CGGCGCTGAC CCGGCTATGG CCCGGCCGCA CCGACCGGCT GATCATCGCC CCGCACGATC TGCGGACCGC CGACGCGACC CGCGCTGCGG AAATCTACGC CGGACGTTTC GTGTTCGCCG GCAAGATCGT GACCTGCCAC GGCCGCTCGA TCTTCGATCT CGAACCGCCG TCGGACGATT GGGAAGCCGC GCTGTTCGGC TTCGGCTGGC TGCGCCATCT GCGCGCCGCC GACACCGCGA TCACCCGCGC CAACGCCCGC TCGCTGGTCG ACGACTGGCT CTCCAATATG GGGCGCAACC GCACGGTGGG GCGGCGCCCG GATGTGCTGG CGCGGCGGGT GATCTCGCTG TTGTCGCAGG CCCCCCTGGT GCTCGGCGAC ACCGACGGAA AATTCTATCG CCGTTATCTG CGCGGCCTGA CCCGCGAGAT TCGCGCGCTG CGCTATGCGC TGCTCGACGT GCCGGATGGC GTGCCGCGGC TGCAGGTGAT GATCGCGCTG TGCTATGCGG CGCTCTGCCT CGCCAATCAG GCGCGCAACA TCCGCGGCGC GACCAAGCGG CTGTCGGAGG AATTGCAGAG CCAGGTGCTG CCCGATGGCG GGCATGTGTC GCGCAATCCC GGCGCGCTGA TCGAATTGCT GATCGATCTG CTGCCGCTGC GGCAGACTTT CGCCGCCCGC AACATCGCCC CGCCGCCGGC GCTGCTCAAT GCCATCGACC GGATGATGCC GATGCTGCGG TTCTTCCGCC ATGGCGACGG CAGCATCGCT TTGTTCAACG GCATGAGCGG CACGCCGTCG GATCTGCTGG CGACGCTGCT CGCCTATGAC GACAGCCACG GCACGCCGAT GCCGAGTATG CCGCATTCCG GCTACCAGCG GATCGACGCC GGATCGACGC TGGTGATCGT CGACGCCGGC CTGCCGCCGC CGCCGAATGT CAGCCAGGAC GCCCATGCCG GCTGCCTGTC GTTCGAATTG TCATCCGGAC AGTGCCGGAT CGTGACCAAT TGCGGAATGC CCGGCACCAA TCGCGAGAGC TGGCGGGCCT TCGCGCGCAG CACGGTGGCG CATTCCACGG TCGCCTGCCA CGATACCTCG TCCTGCCAGT TCATCGAGCG CTCGGCGATG AAGCGGCTGC TGCAGGGCGC GCCGATCGTC AGCGGCCCGA CGCTGATCGA CAACCGCCGC GAGGCGGTTG CGGGCGGCGT GCTGCTGACC ATGTCGCATG ATGGCTACCG CGGAAAACTG GGCGTCCTGC ATCAGCGGGT CATTATGGTG AAGCATGACG GCAGCCGGGT CGACGGCGAG GATACGGTGT CGCCTGCGCA GGGCGGGCGC CACCGCGCGG CGCAGGCCGA CTATGCGGTG CGGTTTCATC TGCATCCGTC GATCAAGGCG ACCCGGCTCG TCGACGCCCA CGGGGTGATG CTGGTGCTGC CCAACCGCGA GGTCTGGACG TTCGAGGCAC TGGACGACAA GGTCGATCTC GAAGACAGCG TGTTCCTCGC CGGCAATGAC GGCCCGCGGC GCACGACCCA GATCGTGATC CGTCAGAATT CGATCGAGGC GCCGAACATC CGCTGGAGCT TCATCCGCTC CAACGCCTCG CCGCAGGCGA CCAATGCGCG CCGCAATGCC CGCCGCGAGC CGGAATTGCC GCTGTAA
|
Protein sequence | MPAELVQTKL RWTAKVADVL PNLKSAVRKA QKVDAFHEPG ALARVSSAQR KRISSLIVGR FARNMAARVS GGSAALTRLW PGRTDRLIIA PHDLRTADAT RAAEIYAGRF VFAGKIVTCH GRSIFDLEPP SDDWEAALFG FGWLRHLRAA DTAITRANAR SLVDDWLSNM GRNRTVGRRP DVLARRVISL LSQAPLVLGD TDGKFYRRYL RGLTREIRAL RYALLDVPDG VPRLQVMIAL CYAALCLANQ ARNIRGATKR LSEELQSQVL PDGGHVSRNP GALIELLIDL LPLRQTFAAR NIAPPPALLN AIDRMMPMLR FFRHGDGSIA LFNGMSGTPS DLLATLLAYD DSHGTPMPSM PHSGYQRIDA GSTLVIVDAG LPPPPNVSQD AHAGCLSFEL SSGQCRIVTN CGMPGTNRES WRAFARSTVA HSTVACHDTS SCQFIERSAM KRLLQGAPIV SGPTLIDNRR EAVAGGVLLT MSHDGYRGKL GVLHQRVIMV KHDGSRVDGE DTVSPAQGGR HRAAQADYAV RFHLHPSIKA TRLVDAHGVM LVLPNREVWT FEALDDKVDL EDSVFLAGND GPRRTTQIVI RQNSIEAPNI RWSFIRSNAS PQATNARRNA RREPELPL
|
| |