Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0392 |
Symbol | |
ID | 8418197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 479407 |
End bp | 481080 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645036956 |
Product | Heparinase II/III family protein |
Protein accession | YP_003197270 |
Protein GI | 258404528 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.276932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.686057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGAC GTTTTCCTGC CATGGATATT CAAACATATT TCCACACCCT TCGGCATCTC AAAGCCGGCC AAATCTATGC CCGTTTGATC CGTAAGCTCC GCCGTCCCCA AATCGATTCC CGTCCGGCGC CGGACATTCG TACCGGGGGG CGGCAAGATT GGGTGCAGCC GGTCAAGCGG CCCCAGTCCA TGCTGGGGCC GGCGACCTTT TGCTTTTTGA ACCAGACCCA CACCTTGCAC TGGCCGCAGG GTTGGAACGA TGCGGTCGCG GACAAATTGT GGCTCTACAA CCTGCACTAT TTTGATGACC TCCATGCCCA GGGCGCAGAA GAGCGCACCG CATGGCACCA CCGGCTGCTT TTGCGCTGGG TGGACGAGAA CCCGCCTGGC AAGGGCAACG GCTGGGAGCC GTATCCCACC TCGCTTCGAG TTGTGAACTG GGTCAAGTGG GCCTTGGCCG GGAATACGTT GCCGGAAGCC GCGGTGCACA GCCTGGCGGT GCAGGTCCGC TGGTTGGGCA AGCATCTGGA ATACCATCTT CTGGGCAACC ACTTGTTTGC CAACGCCAAG GCCCTGGTTT TTGCCGGGGC TTTTTTTGAG GGCCAGGAAG CCCGCCAATG GATGGACAAG GGCCTGGCCA TTTTGGCCCG GGAAATCCCG GAGCAGATTC TGCCAGACGG GGGGCATTTT GAGCGCTCGC CCATGTACCA CGCCATCGCT GTGGAAGATA TGGCGGATCT GGTCAATATC TGCCGGGCCT ATCCGCACAG CCTGCCGCAG CAGGGCCACG AGCTTGTCCG CTCTTGGCCG GATACCGTGC AGCGGATGCT GGAGTGGTTG CAGGCCATGT GCCATCCGGA CGGCAATATC GTGCTGTTGA ACGATGCCGC CTTGAATATT GCCCCTGGGC CAAAGCAGCT GGCCCAATAC TGCCAGCGGC TGGGGATTAG GCCGGCCAGC CGGGAGAGCG GGGAGCGGTT GCGGGTGACG CAGTTGCCGG ACACCGGCTA TATCCGGGTT GACGCCGGGG AGGCGGTTGC CTTTTTAGAC ACCGCCCCCA TTGGGCCGGA TTATCTGCCG GGGCATGCGC ATGCGGACAC ATTGACGTTT GCCCTGTCTG TGTGGGGGCA GCGGCTGATC GTGGACTCCG GGGCGTCATG CTACGGGCTG GGTCCGGAGC GGTTGCGGCA GCGCTCGACC GCCGCGCACA ACACCGTGGT CGTGGACGGG AAGGATTCCT CCGAGGTCTG GTCGGGCTTT CGGGTCGCTC GGCGGGCGTA CCCCAAGGGA TTGACCGTGC GGCAAGACCA CGAGGTGGTC ACGGTGCAGT GCAGCCATGA CGGGTATACG CGGCTGCCGG GGAGCCCTGT GCATACCAGG CGCTGGGAGT TTACTCCGGG CAAGCTGGGC ATGCAGGACA GTATTTCCGA TGCACAATCC GGGGCGCAGG CCCGCTTGCA TCTGCATCCT GCTTGGCACT GCCGACGTGT GGACGGGCAT ACGGCTGACC TGCATCTGGC CCCGGGAAAA CGGGTGCGGG TGGCCATTCG GGGCGGGCAG ATGACAATGG AAGAGAGCAC ATGGCATCCA GAATTTGGCG CGTCGTTGCC GAACATCTGT CTTGTGCTGG ATTTTTACGG GCAATTGTAT ACGGAGATTG GCTGGGACGT ATGA
|
Protein sequence | MGRRFPAMDI QTYFHTLRHL KAGQIYARLI RKLRRPQIDS RPAPDIRTGG RQDWVQPVKR PQSMLGPATF CFLNQTHTLH WPQGWNDAVA DKLWLYNLHY FDDLHAQGAE ERTAWHHRLL LRWVDENPPG KGNGWEPYPT SLRVVNWVKW ALAGNTLPEA AVHSLAVQVR WLGKHLEYHL LGNHLFANAK ALVFAGAFFE GQEARQWMDK GLAILAREIP EQILPDGGHF ERSPMYHAIA VEDMADLVNI CRAYPHSLPQ QGHELVRSWP DTVQRMLEWL QAMCHPDGNI VLLNDAALNI APGPKQLAQY CQRLGIRPAS RESGERLRVT QLPDTGYIRV DAGEAVAFLD TAPIGPDYLP GHAHADTLTF ALSVWGQRLI VDSGASCYGL GPERLRQRST AAHNTVVVDG KDSSEVWSGF RVARRAYPKG LTVRQDHEVV TVQCSHDGYT RLPGSPVHTR RWEFTPGKLG MQDSISDAQS GAQARLHLHP AWHCRRVDGH TADLHLAPGK RVRVAIRGGQ MTMEESTWHP EFGASLPNIC LVLDFYGQLY TEIGWDV
|
| |