Gene TM1040_2805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2805 
Symbol 
ID4076573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2966243 
End bp2968249 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content62% 
IMG OID638008130 
Producthydantoinase/oxoprolinase 
Protein accessionYP_614799 
Protein GI99082645 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.726576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAT TGCTTGGGAT CGACACGGGT GGGACTTACA CCGATGCGGT GTTGATCCGC 
GATGAGTCCG AGGTGATTGC CTCGGCCAAA TCGCTCACCA CACGCGAGGA TCTTGCGATT
GGGGTCGGCG GCGCGGTCCG CGCGGTGTTG GCGCAGTCCG GTGTTGCGCC CGCGGATATT
TCCATGGCCG CATTGTCCAC TACGCTTGCA ACCAACGCAT TGGTTGAAGG GCAGGGCGGG
CGCGTGGCTT TGATCTACAT CGGCTTCAGC GCGCAGGATC TGGAGCGTCA CGGCCTGGAC
ACTGCGCTCA AGGGGGATCC GGTCCTGCTC CTCGCTGGTG GCCACACCCA TGCAGGTTCC
GAAGCAGCTG TGTTGGACAA AGAGGCTCTT ATCGCCTTTT TAGAGACTGA ATCTACTGGT
GTGAGCGGCT TTGCCGTTGC GGGCGTTTTT GCGACACGCA ATCCGGCCCA TGAATTGGAA
GTCGCGCGCC TCATTCAAGA GCACACCGGC GCGCCTGTAA CCTGTTCGCA TCAGCTCTCG
GCAAAGCTCA ATGGGCCCAA GCGGGCGCTG ACGGCCGTTC TGAATGCGCG GCTGATTGGG
ATGATCGACC GGTTGATCGG TCGTGCTGCG GATACGTTGA CGGCGCTGGG GGTGGAGGCG
CCGATGATGG TGGTTCGGGG TGACGGTGCC CTGATGTCCA CGGACCAGGC CCGCGCACGC
CCCATCGAGA CGATCCTCAG CGGGCCTGCC GCCTCCATTG TCGGAGCGCG ATGGATGACT
GGAGCAGAAA ATGCGTTGGT CAGCGATATC GGTGGCACCA CCACCGATGT TGCCTTGATC
GAACGCGGCC TGCCAAAGAT CGATCCGGAC GGTGCAGAAG TTGGCGGATA TCGCACCATG
GTCGAGGCAG TCGCCATGCG CACCACGGGG CTTGGAGGCG ACAGCGAAGT GCATTTGCAG
CGGGAAGGGC TGCGTGGTGG CGTTGCGCTG GGGCCGCGCC GGGTCTTGCC GATCTCCCTG
ATCGCCACGC AGGCGCCGGA AACCGTTCAT GCGGCGCTCG ACACACAGCT TCATGCGCAG
GTGATCGGGG AATATGACGG GCGTTTCGTG CGTGGCGTGA CCGGGCAACA AGCGCATGGT
CTGAGTGCGC GTGAGGAGGC TCTGTTGTCT CGGATCGGAC AGGATGTCCG TCCGCTTGGC
GAGGTGCTGC GATCGCGCGT GGACCAAAGT GCCCTGAGCC GACTGATCGA CCGTGGGTTG
GTGCAAATGG CCGGTGTGAC GCCTTCGGAT GCGAGTCACG CTCTCGGGCG GGTGTCGACC
TGGGATAATG AGGCGGCACA CAAGGCGCTG ACAATCTTTG CCAAGCGCCG CGTCGGGAGC
GGCGACATGG TTGCAAAGGA TGCGGACACG CTGGCCCAAA TGATCGTGGA TCAACTTACC
GAACAGACTG CATTGACCTT ATTAGAGTCT GTTTTCTCCG AAGAACGGGC TCTGTTTGGT
GGCAGTCCGT CTGATCTGGC GCGGCATGTT CTGCTGCAAC GGGGGCTTCA GGGGCACACC
GGGCTTGTGG CGCTTGATGC GCGGGTGAAT GTGGATGTGA TCGGATTGGG CGCATCAGCA
CCTAGTTATT ACCCGGCGGT GGGGGAGCGT CTACGCTGCA ATATGATCCT GCCTGAACAC
GCTGGTGTTG CAAACGCAAT CGGCGCCGTG GTCGGACGGA TTGTGCAGCG CCGTTCGGGC
ACTGTGACAG CCCCGTCCGA GGGTCGCTTT CGTGTACATC TGGCTGATGG CCCCGTGGAT
TTCACCGAGA GCGAAGCAGC ACTGACGGCG CTAGAATACG CCCTGCGCGC CGAGACCGTT
GCAGCCGCAG AGGAAGCCGG GGCAGGAGAT ATCCATGTGC ATGTGACCAA GGATCTGCGC
ACCGCAGAGG TTGAGGCGCG GCAGGTGTTT GTCGAGGCAA CCCTCACCGT AGAAGCAAGC
GGTCGCCCGC GTGTGGCACA TGCTTAG
 
Protein sequence
MAILLGIDTG GTYTDAVLIR DESEVIASAK SLTTREDLAI GVGGAVRAVL AQSGVAPADI 
SMAALSTTLA TNALVEGQGG RVALIYIGFS AQDLERHGLD TALKGDPVLL LAGGHTHAGS
EAAVLDKEAL IAFLETESTG VSGFAVAGVF ATRNPAHELE VARLIQEHTG APVTCSHQLS
AKLNGPKRAL TAVLNARLIG MIDRLIGRAA DTLTALGVEA PMMVVRGDGA LMSTDQARAR
PIETILSGPA ASIVGARWMT GAENALVSDI GGTTTDVALI ERGLPKIDPD GAEVGGYRTM
VEAVAMRTTG LGGDSEVHLQ REGLRGGVAL GPRRVLPISL IATQAPETVH AALDTQLHAQ
VIGEYDGRFV RGVTGQQAHG LSAREEALLS RIGQDVRPLG EVLRSRVDQS ALSRLIDRGL
VQMAGVTPSD ASHALGRVST WDNEAAHKAL TIFAKRRVGS GDMVAKDADT LAQMIVDQLT
EQTALTLLES VFSEERALFG GSPSDLARHV LLQRGLQGHT GLVALDARVN VDVIGLGASA
PSYYPAVGER LRCNMILPEH AGVANAIGAV VGRIVQRRSG TVTAPSEGRF RVHLADGPVD
FTESEAALTA LEYALRAETV AAAEEAGAGD IHVHVTKDLR TAEVEARQVF VEATLTVEAS
GRPRVAHA