Gene TM1040_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1455 
Symbol 
ID4077752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1553254 
End bp1554612 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID638006766 
Productcytochrome P450 
Protein accessionYP_613450 
Protein GI99081296 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.756102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.447397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC GCCCGCCAAA ACCCCCGGTA CGGCCTGACA GAGTCTCGCT CTGGCGCTAC 
CTGAAGCTGT TTCGGGCCGA TATTCTGTCG GCGCAGCCGC AGCGGCTCTA TCGGGCGTGG
ATGGCAGAGT TTCGCACGCC CTTCTTTCGG TCCTTCCTGG TCAATCAACC TGCGCTTCTG
GATGTGATCC TGAAAGAGCG CCCGGATGAT TTCCCGAAAT CCAACCGCGT GGGCGAGGGG
CTGCGGCCAC TCCTTGGAAA CTCCGTCTTT CTGACCAATG GCGAGACCTG GAAACGGCAG
CGGCGCATCA TTGATCCCGC CTTTGAGGGC GGTCGACTGA AGGAGAGCTT TCCAGCGATG
CGCGCCGCAG CCGAGGCAGG GGTTGCGCGT TTGCGTCCAC ACGCGGATGG GTCAGAACTC
GAGATCGAGG CTGAAGCCTC GCATATCGCG GCGGATGTGA TCTTTCGCAC GCTGTTTTCC
ATTCCCATCG AACATGAGGT CGCCGCAGAG GTCTTTTCCC GGTTCCGCGC CTATCAGCAG
GCGCAGCCGA TCCTCAATCT GGCGGCCTTT GTACCGGTGC CCCGCTGGAT GCCCCGGTTC
TACCCCAAGG GAACCCGACA GAACGCGCGC CATATTCGCA GGCTGATTGC TGATCTGACC
AAGGCTCGGA TGGCAGAGAT TGCCGCGGGC ACAGCACCAG ACGATCTGGC GACCAAGATT
ATGACCACGC TGGACCCGGA AACCGGCAAA GGGTTTGGAG CCGAGGAAAT GGTCGATCAG
GTGGCGATCT TCTTTCTGGC CGGGCATGAG ACCAGTGCCT CGGCGCTGGG GTGGGCGCTC
TATCTGTTGG CGCTTTATCC CGAATGGCAG GAGAAGCTGG CCGCCGAAGT AGCAGAGCAT
GGTGCAGAGG AATTTGCGGA TCTGTCAAAG CTGCGCCTGA CGCGCGATGT GTTTCGCGAG
ACGCTGCGGC TGTATCCACC GGTGCCGATG ATGGTACGCG AGGCAGTTCA GACAGAGAGG
TTCCGGGACC GCGAGGTGCT CAGGGGATCT CAGATGGTGC TCAGCCCTTG GCATCTGCAT
CGCCACGAAC GTCTCTGGGA GCGGCCGGAT GAGTTCGATC CTGGCCGATG GCAGAGCGAG
AACGGAAAAG CCTGTGCCCG GAACGCATAT ATGCCGTTCT CGGCGGGCTC CCGGGTCTGT
ACGGGGGCCG GGTTTGCCAT GGTCGAGGGG GTCTTAATCC TCGCGCAAAT TCTGCGCCAC
TATCGCATCA CGCCTGTCGA AGGTCGGAGC CCCGAGCCCG TTGCGCATCT GACGGTACGC
TCTCGCACGG GCATCTGGCT GCGTTTTTCG CATCGCTAG
 
Protein sequence
MTLRPPKPPV RPDRVSLWRY LKLFRADILS AQPQRLYRAW MAEFRTPFFR SFLVNQPALL 
DVILKERPDD FPKSNRVGEG LRPLLGNSVF LTNGETWKRQ RRIIDPAFEG GRLKESFPAM
RAAAEAGVAR LRPHADGSEL EIEAEASHIA ADVIFRTLFS IPIEHEVAAE VFSRFRAYQQ
AQPILNLAAF VPVPRWMPRF YPKGTRQNAR HIRRLIADLT KARMAEIAAG TAPDDLATKI
MTTLDPETGK GFGAEEMVDQ VAIFFLAGHE TSASALGWAL YLLALYPEWQ EKLAAEVAEH
GAEEFADLSK LRLTRDVFRE TLRLYPPVPM MVREAVQTER FRDREVLRGS QMVLSPWHLH
RHERLWERPD EFDPGRWQSE NGKACARNAY MPFSAGSRVC TGAGFAMVEG VLILAQILRH
YRITPVEGRS PEPVAHLTVR SRTGIWLRFS HR