Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1599 |
Symbol | |
ID | 4078408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1710889 |
End bp | 1711800 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638006912 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_613594 |
Protein GI | 99081440 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0042965 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.464742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAATG CGTACCTTCA AAGCCTCCAC ACCAGCGCGT CTCTTCGGCA GCTCTTGCAG TTGCGTCCCT TTGAGGCGCC CGTGTCATCT GAAATGGGCG ATAGCCGGAT CGAGTTGGAG GCCGAGAACG GAACGCGCCT TGTTCTTGAC GGGTCGTTCT CTTCCGAGGA CGAAGGGAGC TGGGTCATTT ATGCTCTGGC GCTCTATGAC GGGGCCGACA CGCTGGTGTC GATCTCGGAC ATGGCGCTCT CCTACGAGGA TTTTTTTGCA GCCAATGGCA GAGAGCGCCT GAACATGGTG TTCGAAGGCG CCGATGAGAT CACGTCTAAT CTCGGCTCTG GTCTTCGTAT GAATTTTCAA GCCGGCGATG ACGTTATCAC CCTTGGCTTC GGTGATGACG TGATCAACGG CGGTCGTGGC AACGACACGG TGTCTGCCGG GGCAGGGAAT GACACGATCA ACGGCAACGC GGGCAGAGAC CAGATCAATG GCCATGACGG AGACGACGTC CTGACGGGCG GCGGCGGTCG CGACAAGCTA AACGGTGGTG TTGGAAACGA CGTTCTGTCC GGGAACGCGG GACGCGACAA GCTGGTCGGT GGCACTGGCA ATGACACGCT CTCTGGCGGT GCGGATGATG ATGTCCTGCG CGGTGGAACC GGCGATGACG TTCTGACCGG CGGCGCGGGC GCAGATGTGT TCAAGTTTCG GGCCAATGAC CACACCAATG TGATTACAGA CTTCGAAGTT GGCGTCGATC ACATCGAGGT CCTCAAAGGC GCGCGCGGTA TGCGTGGCGT TGATTTTGAG CAAATCGGCG ACGATGTGGC GGTCTATTTC GGCAACGTAA CCGTGATTGT CCAAGACACC ACGGTCGAGT TGATGGACGA CAGCGACAAC TTCTTGTTCT GA
|
Protein sequence | MANAYLQSLH TSASLRQLLQ LRPFEAPVSS EMGDSRIELE AENGTRLVLD GSFSSEDEGS WVIYALALYD GADTLVSISD MALSYEDFFA ANGRERLNMV FEGADEITSN LGSGLRMNFQ AGDDVITLGF GDDVINGGRG NDTVSAGAGN DTINGNAGRD QINGHDGDDV LTGGGGRDKL NGGVGNDVLS GNAGRDKLVG GTGNDTLSGG ADDDVLRGGT GDDVLTGGAG ADVFKFRAND HTNVITDFEV GVDHIEVLKG ARGMRGVDFE QIGDDVAVYF GNVTVIVQDT TVELMDDSDN FLF
|
| |