Gene TM1040_3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3721 
Symbol 
ID4075428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp781124 
End bp782383 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID638005241 
Productcytochrome P450 
Protein accessionYP_611950 
Protein GI99078692 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.540203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCT GGACCCCGAC CGATGACGGA TATGCGGATC TGTCGAGCCA TGACGCCTTT 
GCCAATGGCG CGCCGCACAA CACCTTTGCC CGGCTCCGGC GTGATGATCC GCTGCATTGG
ACCGAATACA GCGATGGCGA GAATTTCTGG TCGGTCACGC GCTATGACGA CATCACCAAG
ATGAACAAGA ACACAGAGAT CTTTTCCTCG GCGCGCGGCA TCCGCATGGA GGATCAGACC
TACGAGGAAT ACCTCGCGCG GCGCACTTTT CAGGAAACCG ACCCGCCCGA ACATTCTCAG
GTGCGCATGA AGCTCCTGAA GGCGTTCTCC AAGACCACCA TGGCGCAATA CGAGGCGGAC
ATTCGCGACC TTTGTGCGGA GATCCTCGAC GAGGCCTTGG CAAAAGGCAG CTTTGACGCC
ACCAAGGAGA TCGCCAGGCA GTTGCCCATG CGGATGCTCG GGCGCGTCGT CGGCTTGCCG
GACGCGGATC TGCCGTGGCT GGTGGAGAAG GGGGATGCGC TCATTGCGAA TACTGACCCC
GATTTCACCT CGCATGTGCT GGACAAAATG GACACGGATG AATTCCGCAT GATGCCCTTC
AACTCTCCGG CGGGTGCAGA ATTATACATC TACGCCAAGG AGTTGATGGA AGCCAAAGAG
AGGGCAGGTG ACACCTCCGG CGTGCTCAAC ATGATTCTGC AGCCGGCCCG AGACGGATCG
GTCATTACCG AAACCGAGTT TCGCAATTTC TTCTGCCTCT TGGTGGCTGC GGGCAACGAC
ACCACGCGTT ACTCCATCGC GGCTGGCATT CAGGCGATGT GTCATCAGCC GGAGCTTCTG
GCGCAGATGC AGGCGGGCGG GGAGATTTGG GAGACGGCGG CGGATGAGAT CATCCGCTGG
GCGACGCCTG CGCTCTATTT CCGCCGCACG GCGACGCAGG ATGTCGAGAT GCATGGCAAG
ACCATCCGCG AAGGCGATAA GGTGCTCTAT TGGTTTGCCT CTGCCAATCG CGACGACAGC
TATTTTGACG ACCCGTTTCG GGTGAACCTG ATGCGCAATC CCAACCGGCA CCTGTCTTTC
GGCCAGTTCG GCCCGCATGT CTGCCTGGGC ATGTGGCTCG CACGGCTTGA GGTCACGGTT
CTGTTTCAGG AACTCTCCAA GCGGATCAAA TCCATCGAAC CCAATGGTGC CCACAAATTC
CTGCGGTCGA ACTTTGTCGG CGGCATCAAG GAATTGCCCG TGCGGGTAGA AGCCGCGTAG
 
Protein sequence
MTIWTPTDDG YADLSSHDAF ANGAPHNTFA RLRRDDPLHW TEYSDGENFW SVTRYDDITK 
MNKNTEIFSS ARGIRMEDQT YEEYLARRTF QETDPPEHSQ VRMKLLKAFS KTTMAQYEAD
IRDLCAEILD EALAKGSFDA TKEIARQLPM RMLGRVVGLP DADLPWLVEK GDALIANTDP
DFTSHVLDKM DTDEFRMMPF NSPAGAELYI YAKELMEAKE RAGDTSGVLN MILQPARDGS
VITETEFRNF FCLLVAAGND TTRYSIAAGI QAMCHQPELL AQMQAGGEIW ETAADEIIRW
ATPALYFRRT ATQDVEMHGK TIREGDKVLY WFASANRDDS YFDDPFRVNL MRNPNRHLSF
GQFGPHVCLG MWLARLEVTV LFQELSKRIK SIEPNGAHKF LRSNFVGGIK ELPVRVEAA