Gene TM1040_1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1351 
Symbol 
ID4076368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1441498 
End bp1442535 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content59% 
IMG OID638006661 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_613346 
Protein GI99081192 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCCA TGTGCACCGG AACAGAATAC GAGACCGAGA TGCCCCATAT CAGCGCTTCC 
TTTCCTGCCT CCCGCCCCCG TCGCCTGCGT GCCTCTGCCG CTTTGCGCGA CCTCACGCGG
GAGAACGAAC TCTCGGTGAA TGACCTGATC TGGCCGGTTT TTGTACGCGA TGGGGAGGGG
ATCGAGGAGC CGGTTCCCTC GATGCCAGGC GTGGTGCGGC GCTCTGTCGA CAAGGTGGTT
GAGGCCGCCG TCGAAGCGCA GGCACTCGGA ATTCCGGCGA TCTGCCTCTT TCCCTACACG
GATCCGTCTT TGAAAACAGA GGATTGTGCC GAGGCTTGGA ACCCGGAGAA CCTCTGCAAT
CGGGCCATCC GTGGGATCAA GGCCGCGGCG CCCGATCTCG CCGTGATGAC CGATGTCGCG
CTAGATCCCT ATAATATCAA CGGCCACGAC GGCTTTGTGA TTGATGGCGA AATTCGCAAC
GACGAAACCG TCGAGGCGCT GGTCAAGATG ACCCTCGCAC AGGCCGAGGC CGGGGCGGAT
ATCATCGGCC CCTCTGACAT GATGGACGGG CGCATCGGAG CCATGCGGTC TGCATTGGAA
AGGAAGGGAT TTCAGAATGT TACGATCCTG TCCTACTCTG CAAAATACGC GTCTGGATTT
TATGGACCGT TTCGTGATGC GGTCGGCGCC TCGGGGGCCC TGACCGGCGA CAAGAAGACC
TATCAGATGG ACCCCGCCAA CACCAATGAA GCCCTTCGCA TGATTGAACG CGATCTGCGC
GAGGGGGCGG ATATGGTGAT GGTGAAACCC GGATTGCCCT ATCTCGACAT CTGCCACCGC
GTGAAAGAGA CCTTCCAGGT CCCGACCTTT GCCTACCAGG TGTCAGGAGA ATACGCGATG
ATCCAAGCGG CGGCTCTGAA TGGTTGGATC GATGGGGAAA AAGTTATGCT AGAAAGCCTC
ATGGCCTTCC GTCGGGCTGG ATGTGATGGT GTGCTTACCT ATTTTGCGCC ACAGGTCGCG
AAACTGTTGA ACGGCTAA
 
Protein sequence
MQPMCTGTEY ETEMPHISAS FPASRPRRLR ASAALRDLTR ENELSVNDLI WPVFVRDGEG 
IEEPVPSMPG VVRRSVDKVV EAAVEAQALG IPAICLFPYT DPSLKTEDCA EAWNPENLCN
RAIRGIKAAA PDLAVMTDVA LDPYNINGHD GFVIDGEIRN DETVEALVKM TLAQAEAGAD
IIGPSDMMDG RIGAMRSALE RKGFQNVTIL SYSAKYASGF YGPFRDAVGA SGALTGDKKT
YQMDPANTNE ALRMIERDLR EGADMVMVKP GLPYLDICHR VKETFQVPTF AYQVSGEYAM
IQAAALNGWI DGEKVMLESL MAFRRAGCDG VLTYFAPQVA KLLNG