Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3219 |
Symbol | |
ID | 4075361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 215137 |
End bp | 216363 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004728 |
Product | Rieske (2Fe-2S) region |
Protein accession | YP_611455 |
Protein GI | 99078197 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.201195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGCA ATATCAACAC GCTGATCGCT AACCACCGCG CAGGACATGC ACTTGATCAG GCGTTTTATA CCGATGCCGA GGTGTTTCAG ACCGATCTTC AGGAGATCTT TTACAAAGAA TGGCTTTTTG CCATTCCCGC CTGCGAGCTG GACAAGCCAG GGAGCTACGT CACCCATCAG GTTGGCAACT ACAATGTGAT CATCGTGCGC GGTGCAGACA ATGTCATTCG GGCTTTCCAC AATGCTTGTC GTCACCGTGG CTCGGTGATC TGCAAGGCGA AGAAAGGCAA CAACCCTAAG CTCGTCTGCC CCTATCACCA GTGGACTTAT GAACTGGACG GTCGTCTGCT GTGGGCGCGT GATATGGGGC CTGATTTCGA GCCGAGCAGA CATGGGCTCA AGACGGTCCA CTGCCGTGAG CTTGCTGGGT TGATCTATAT TTGTCTCGCC GATGAGGCCC CGGATTTTGA ACGGTTTGCC GAGGTCGCCC GCCCCTATCT GGAGGTTCAT GACCTCTCGA ACGCCAAGGT CGCCCATGAA AGCTCCATCG TGGAGCGCGG CAACTGGAAG CTGGTCTGGG AGAACAACCG CGAGTGCTAC CACTGCGGCG GCAATCACCC CGCGCTCTGC CGGACCTTCC CGGATGATCC CTCCGTGACG GGCATCGAAG GTGGCGAGAC CCCGAGCAAT TTGCAGGCTC ATTTCGACCG CTGTGAGCAG GCTGGGATGC CTTCGGGGTT CCACCTCAGC GGTGATGGCC AGTTCCGTGT CGCGCGCATG CCCCTGAAAG AAGGCGCTGA GAGCTACACG ATGGACGGCA AGACCGCCGT GCGTCGCTGG CTGGGCCGTG CAGCCTTTGC GGATGCGGGC TCGTTGCTCA AGTTCCACTA CCCGACCACT TGGAACCACT TCCTGTCGGA CCATTCGATC GTGTTCCGGG TCACGCCCAT CAGCCCCACG GAAACCGAGG TGACGACAAA ATGGCTGGTT CACAAAGACG CGGTTGAAGG TGTGGATTAC GATCTACAGC GGCTCACCGA GGTTTGGATT GCCACCAATG ACGAAGACCG CGAGGTTGTG GAGTTCAACC AGATGGGGAT CAACTCGCCG GCCTATGAAC CGGGGCCCTA TTCCCCGACC CAAGAGAGCG GCGTCCTGCA ATTTGTGGAG TGGTATCTCT CTACCCTCAA ACGCAACAGC GGCCCACACG CCGTCGCAGC GGAGTGA
|
Protein sequence | MHSNINTLIA NHRAGHALDQ AFYTDAEVFQ TDLQEIFYKE WLFAIPACEL DKPGSYVTHQ VGNYNVIIVR GADNVIRAFH NACRHRGSVI CKAKKGNNPK LVCPYHQWTY ELDGRLLWAR DMGPDFEPSR HGLKTVHCRE LAGLIYICLA DEAPDFERFA EVARPYLEVH DLSNAKVAHE SSIVERGNWK LVWENNRECY HCGGNHPALC RTFPDDPSVT GIEGGETPSN LQAHFDRCEQ AGMPSGFHLS GDGQFRVARM PLKEGAESYT MDGKTAVRRW LGRAAFADAG SLLKFHYPTT WNHFLSDHSI VFRVTPISPT ETEVTTKWLV HKDAVEGVDY DLQRLTEVWI ATNDEDREVV EFNQMGINSP AYEPGPYSPT QESGVLQFVE WYLSTLKRNS GPHAVAAE
|
| |