Gene TM1040_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3219 
Symbol 
ID4075361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp215137 
End bp216363 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content59% 
IMG OID638004728 
ProductRieske (2Fe-2S) region 
Protein accessionYP_611455 
Protein GI99078197 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.201195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGCA ATATCAACAC GCTGATCGCT AACCACCGCG CAGGACATGC ACTTGATCAG 
GCGTTTTATA CCGATGCCGA GGTGTTTCAG ACCGATCTTC AGGAGATCTT TTACAAAGAA
TGGCTTTTTG CCATTCCCGC CTGCGAGCTG GACAAGCCAG GGAGCTACGT CACCCATCAG
GTTGGCAACT ACAATGTGAT CATCGTGCGC GGTGCAGACA ATGTCATTCG GGCTTTCCAC
AATGCTTGTC GTCACCGTGG CTCGGTGATC TGCAAGGCGA AGAAAGGCAA CAACCCTAAG
CTCGTCTGCC CCTATCACCA GTGGACTTAT GAACTGGACG GTCGTCTGCT GTGGGCGCGT
GATATGGGGC CTGATTTCGA GCCGAGCAGA CATGGGCTCA AGACGGTCCA CTGCCGTGAG
CTTGCTGGGT TGATCTATAT TTGTCTCGCC GATGAGGCCC CGGATTTTGA ACGGTTTGCC
GAGGTCGCCC GCCCCTATCT GGAGGTTCAT GACCTCTCGA ACGCCAAGGT CGCCCATGAA
AGCTCCATCG TGGAGCGCGG CAACTGGAAG CTGGTCTGGG AGAACAACCG CGAGTGCTAC
CACTGCGGCG GCAATCACCC CGCGCTCTGC CGGACCTTCC CGGATGATCC CTCCGTGACG
GGCATCGAAG GTGGCGAGAC CCCGAGCAAT TTGCAGGCTC ATTTCGACCG CTGTGAGCAG
GCTGGGATGC CTTCGGGGTT CCACCTCAGC GGTGATGGCC AGTTCCGTGT CGCGCGCATG
CCCCTGAAAG AAGGCGCTGA GAGCTACACG ATGGACGGCA AGACCGCCGT GCGTCGCTGG
CTGGGCCGTG CAGCCTTTGC GGATGCGGGC TCGTTGCTCA AGTTCCACTA CCCGACCACT
TGGAACCACT TCCTGTCGGA CCATTCGATC GTGTTCCGGG TCACGCCCAT CAGCCCCACG
GAAACCGAGG TGACGACAAA ATGGCTGGTT CACAAAGACG CGGTTGAAGG TGTGGATTAC
GATCTACAGC GGCTCACCGA GGTTTGGATT GCCACCAATG ACGAAGACCG CGAGGTTGTG
GAGTTCAACC AGATGGGGAT CAACTCGCCG GCCTATGAAC CGGGGCCCTA TTCCCCGACC
CAAGAGAGCG GCGTCCTGCA ATTTGTGGAG TGGTATCTCT CTACCCTCAA ACGCAACAGC
GGCCCACACG CCGTCGCAGC GGAGTGA
 
Protein sequence
MHSNINTLIA NHRAGHALDQ AFYTDAEVFQ TDLQEIFYKE WLFAIPACEL DKPGSYVTHQ 
VGNYNVIIVR GADNVIRAFH NACRHRGSVI CKAKKGNNPK LVCPYHQWTY ELDGRLLWAR
DMGPDFEPSR HGLKTVHCRE LAGLIYICLA DEAPDFERFA EVARPYLEVH DLSNAKVAHE
SSIVERGNWK LVWENNRECY HCGGNHPALC RTFPDDPSVT GIEGGETPSN LQAHFDRCEQ
AGMPSGFHLS GDGQFRVARM PLKEGAESYT MDGKTAVRRW LGRAAFADAG SLLKFHYPTT
WNHFLSDHSI VFRVTPISPT ETEVTTKWLV HKDAVEGVDY DLQRLTEVWI ATNDEDREVV
EFNQMGINSP AYEPGPYSPT QESGVLQFVE WYLSTLKRNS GPHAVAAE