Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1802 |
Symbol | |
ID | 4076948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1893699 |
End bp | 1894817 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638007117 |
Product | hypothetical protein |
Protein accession | YP_613797 |
Protein GI | 99081643 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4335] DNA alkylation repair enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.462091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0113105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGT CGTTTTCGCT GAAGGATCAG CTTTTTAACG TGGAAAAGAC CCGCTATCTG GCGGGGCTTT TTGACGCGGC TTCGGTGGAG TTTGACCCGC GCGCCTTTGA GGCGGATGTG ATGGCGCGGC TCCTTGATCT TGAGCTCAAG GCGCGCATCA ACTGGATCGC CGAGATGCTG TCAAAGCATG TCCCGGGTCC GCTCGATCAA GTTGCACCGG TGATTTTTGC GGCTCTGCCG CCACCTTTGG ACCCAAGCCT GCGCGATGAT GACTTTGGCG ATTTCATCTT TGCTCCGCTT GGGGAATGGG TTGCGGATCT CACCAAGACC GAGGCCGATC TGCCTCTGGC GCTGGATCTA CTGGAGGCCG TAACACAGCG TTTTTCCATG GAGTTTGCGA TCCGCCCGCT GTTGAAAACC TGGCCCGATC CTGTGCTTGC GCGCATGTCG CGTTGGGCCG GGCACGCGCA TTATCATGTG CGAAGGCTTG CCAGTGAGGG CACGCGCCCG CGGCTGCCCT GGGGCCTTGC GGTAAATCTG CCGTTGGATG CGCCCTTGCC GATCCTAGAC CGTCTTCACG GCGATGACAC ACGTTTTGTC ACACGTTCGG TGGCCAATCA CTTGAATGAC ATCGCCAAAA AAGACCCGCA GATCGTGGTG GACCAACTGA CCGCGTGGCA AGCGCGCGGC GAGCAGGCCC AAAAAGAGCT GGACTGGATG ACCTCCCATG CGCTGCGTGG CCTGATCAAG GCGGGCGATC CCCGCGCGCT GCGACTGCTT GGGTATGATC CGGAACTTGA TCTCTCGGCA GAGCTTGAAC TGCCCGGACG CGTGCGGATC GGTGAAAAAC TGATGTTGGG CGCGCGGCTG CAGGGGGGCA GGGGCGCGCG GGTGCTGGTG GATTATGCCC TGACCTTTCA ACGGGCTGGT GGCAAGACCT CAACCAAGGT GTTCAAGTGG AAGACCGGCA CGCTGGGCGC TGACGGCCTG AGCTTGCAAA AGACGCATCC GCTTAAGGCG CAGGCCTCGA CCTTTACGCT GTTGCCGGGG GCGCATCGGG TGACGCTGAT GGTCAATGGC CAGCCCCGGG CAAGCGGCGA GGTGGAGTTC CTTGCCTGA
|
Protein sequence | MAESFSLKDQ LFNVEKTRYL AGLFDAASVE FDPRAFEADV MARLLDLELK ARINWIAEML SKHVPGPLDQ VAPVIFAALP PPLDPSLRDD DFGDFIFAPL GEWVADLTKT EADLPLALDL LEAVTQRFSM EFAIRPLLKT WPDPVLARMS RWAGHAHYHV RRLASEGTRP RLPWGLAVNL PLDAPLPILD RLHGDDTRFV TRSVANHLND IAKKDPQIVV DQLTAWQARG EQAQKELDWM TSHALRGLIK AGDPRALRLL GYDPELDLSA ELELPGRVRI GEKLMLGARL QGGRGARVLV DYALTFQRAG GKTSTKVFKW KTGTLGADGL SLQKTHPLKA QASTFTLLPG AHRVTLMVNG QPRASGEVEF LA
|
| |