Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0090 |
Symbol | |
ID | 4078756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 92456 |
End bp | 94201 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005377 |
Product | heparinase II/III-like |
Protein accession | YP_612085 |
Protein GI | 99079931 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.379483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAAT ATGACCGAAT GGCCAGTCGC GGCACGCGAC TGCTGAACCG CTATACCGCC TGGAAGGCGC GCAAACAGCC CGCGGCCACT GGGTTTGTCT CGCAGCCCGA GCCGCGTACC ATAGGCAGTT TTGCGCGTGG GCGGCAGCTG GTGGCCGGCA ACCTCCTGTT TGCAGGCTAT CTGGTAGAGA GTGACACCAC TGGCCTTTGG GATGTGGAGG CGCCTGATTT TGCCTTTGAG GCAGAGCGCC AGGGCTGTAC ATGGCTCGAT GATCTGGCCG CGGTCGGCGA TCTGAAGGCG CGCAGCAAGG CCCAGCACTG GGTCCGGGGC TGGATCGATG AGTTTGGCAA GGGCACTGGT CCCGGCTGGT CGCCGGATCT GACCGGGCGG CGCGTGATTC GATGGATCAA CCACGCCCTA TTCCTGCTGA GTGGTCAGGA CAAACCAGCC TCCGATGCCT TTTATCGGTC TCTCTCGCAG CAGACGTGGT TCCTGTCGCA GCGCTGGAAG GGGGCATCGC CTGGCTTGCC GCGGTTTGAG GCGCTTACCG GGCTGATCTA TGCGGGGCTT GCGCTCAAGG GCTGCGAGGA ACTCGCAGAC CCTGCGGTCA AAGCGCTGGC TCAGGACTGC GCGCAGCAGA TCAACGCCGA GGGCGGCCTG CCGACCCGCA ACCCCGAAGA GCTACTGGAT GTGTTCACCT TGCTCACATG GGCCGCAGCG GCGTTGCATG AAGCGGGACG GTCGGTGCCG CGCGAGCATC ACGCCGCGCT GGATCGTATC GCACCCACAC TGCGGGCCTT GCGCCACAGC GACGGGGCGC TGGCGCGGTT TCATGGCGGC GGGCGCGGGC AGGAGGGCTG GCTCGATCAC GCGCTGGCCG CCTCGCATGT CCCTGCAAGA CCCTTTGAAG GGCTGGCGAT GGGGTTTGCA CGACTCTCGG CGCGGCGCAC TTCGCTCATC ATTGATGCCA CCGTGCCACC CGTTGGCAAG GCCTCCTACA ATGCACACGC TTCGACTTTG GCTTTCGAGC TAACGTCCGG GCGGCGTCCC TTGATCGTGA ATTGCGGCGC GGGCGAGAAT TTCGGGCTAG AGTGGCGTCG GGCGGGACGG GCCACGCCCT CTCATTCTGC GCTCTGTATC GAAGGGCACT CAAGCGCCCG GCTGGCCGCG CCGCAAAAGG GCACGGGACA TGAGTTCCTG ATCGACGCGC CCACCGATGT GCCAATCGAG CGCGAAGACC TTGTGGATGG GTATCGGTTT CAGGGTGCTC ATGATGGCTA TGCCAAATCC TATGGCGTGA CCATTGCGCG CTCTTTGGAA CTGTCGGTGG ACGGGCGCAT GGTGTCGGGC GAGGACATGG TGCTGGCACT TGATGACGCC GCAAAAAAGT GCTTCGACAG GGCGCTGGAC GCGGGCGGCC TGCGCGGTAT TGGCTATGAT TTACGGTTCC ATTTGCACCC GGATGTGGAC GCAGCCCTTG ACTTAGGGGG CGCAGCAGTA TCCATGGCGC TCAAGAGTGG GGAAATCTGG GTATTCCGTC ACGATGGTCA ATGCGACCTC AAGCTGGAAA CCAGCGTTTA CCTGGAAAAG GCCCGCTTGA AGCCGCGTCA ATCGCTGCAA ATCGTCCTGT CGGGCCGGGC CATTCAATAT GCGACCCAGA TCCGCTGGAC CCTCAGCAAG GCGCAGGAAA CGGCTGTGGC CGTGCGCGAC TTGGCCCGCG ACGACCCCAT GGCCTACGAA GAGTGA
|
Protein sequence | MSKYDRMASR GTRLLNRYTA WKARKQPAAT GFVSQPEPRT IGSFARGRQL VAGNLLFAGY LVESDTTGLW DVEAPDFAFE AERQGCTWLD DLAAVGDLKA RSKAQHWVRG WIDEFGKGTG PGWSPDLTGR RVIRWINHAL FLLSGQDKPA SDAFYRSLSQ QTWFLSQRWK GASPGLPRFE ALTGLIYAGL ALKGCEELAD PAVKALAQDC AQQINAEGGL PTRNPEELLD VFTLLTWAAA ALHEAGRSVP REHHAALDRI APTLRALRHS DGALARFHGG GRGQEGWLDH ALAASHVPAR PFEGLAMGFA RLSARRTSLI IDATVPPVGK ASYNAHASTL AFELTSGRRP LIVNCGAGEN FGLEWRRAGR ATPSHSALCI EGHSSARLAA PQKGTGHEFL IDAPTDVPIE REDLVDGYRF QGAHDGYAKS YGVTIARSLE LSVDGRMVSG EDMVLALDDA AKKCFDRALD AGGLRGIGYD LRFHLHPDVD AALDLGGAAV SMALKSGEIW VFRHDGQCDL KLETSVYLEK ARLKPRQSLQ IVLSGRAIQY ATQIRWTLSK AQETAVAVRD LARDDPMAYE E
|
| |