Gene TM1040_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1498 
Symbol 
ID4077054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1603637 
End bp1604887 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID638006811 
Productallantoate amidohydrolase 
Protein accessionYP_613493 
Protein GI99081339 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC TTGGACAGAA TCTGAAAATC AATGGCGATC GGCTGTGGGA CAGTCTGATG 
GATATGGCCA AGATTGGCCC CGGTGTGGCC GGTGGCAACA ATCGCCAGAC GCTCACCGAT
GAGGACGCCG AAGGCCGCGC CCTGTTTCAG TCGTGGTGTG AAGCGGCGGG CATGACCATG
GGGCTCGACT CCATGGGCAA TATGTTTGCA ACCCGCCCGG GTGAAGATCC CGAGGCATTG
CCGGTCTACA TGGGATCGCA TCTCGACACC CAGCCGACGG GCGGCAAATA CGATGGCGTT
CTAGGGGTGC TTGGCGGTCT TGAGGTCGTG CGGACCATGA ATGACCTCGG CATCAAGACC
AAGCACCCGA TCGTGGTGAC CAACTGGACC AATGAGGAAG GCACGCGGTT TGCTCCGGCT
ATGCTGGCGT CGGGCGTGTT TGCCGGCAAA CATACGCAAG ACTGGGCCTA TGGGCGCGAG
GATGCTGAAG GCAAGACCTT TGGCGACGAG CTGAAGCGCA TCGGCTGGGT TGGCGATGAA
GAGGTCGGCG CCCGCAAGAT GCACGCCATG TTTGAGCTGC ACATCGAGCA GGGTCCGATT
CTTGAGGCTG AGAAGAAAGA CATTGGCGTG GTGACACACG GTCAGGGGCT CTGGTGGTTG
CAATGTACCG TGACCGGCAA GGATGCGCAC ACTGGCTCGA CCCCGATGAA TATGCGGGTG
AACGCCGGGC TCGGCATGGC GCGGATGACG GAAGCGGCGC ATCAGATCGC CATGGCGCAT
CAGCCGCATG CAGTGGGCGC AGTGGGGCAT TGCGATGTCT TCCCCAACTC GCGCAATGTG
ATCCCGGGCA AGGTGGTGTT CACCGTGGAT TTCCGCTCTC CCGACCTTGA AAAGCTGACA
TCGATGCGCA CGCAATACGA AGCCAAAGCA AAGGAAATCG CGGCGGAGCT CGGTCTCGGT
CTGGAGATCG AGCCGGTGGG GCATTTCGAC CCGGTGACCT TTGATGAGAG CTGCGTGAGT
GCGGTTCGGG GCGCGGCAGA GCGTTTGGGC TATAGCCACA TGGATATCGT CTCTGGCGCG
GGGCATGATG CCTGCTGGAT CAATGATGTG GCACCCACCG CGATGATCAT GTGTCCTTGC
GTGGACGGTC TGAGCCATAA TGAGGCGGAA GAGATTTCGA AGGACTGGGC CGCGGCCGGC
ACGGATGTGA TGCTGCATGC GGTGCTTGAG ACGGCTGAGA TCGTCGCCTG A
 
Protein sequence
MTALGQNLKI NGDRLWDSLM DMAKIGPGVA GGNNRQTLTD EDAEGRALFQ SWCEAAGMTM 
GLDSMGNMFA TRPGEDPEAL PVYMGSHLDT QPTGGKYDGV LGVLGGLEVV RTMNDLGIKT
KHPIVVTNWT NEEGTRFAPA MLASGVFAGK HTQDWAYGRE DAEGKTFGDE LKRIGWVGDE
EVGARKMHAM FELHIEQGPI LEAEKKDIGV VTHGQGLWWL QCTVTGKDAH TGSTPMNMRV
NAGLGMARMT EAAHQIAMAH QPHAVGAVGH CDVFPNSRNV IPGKVVFTVD FRSPDLEKLT
SMRTQYEAKA KEIAAELGLG LEIEPVGHFD PVTFDESCVS AVRGAAERLG YSHMDIVSGA
GHDACWINDV APTAMIMCPC VDGLSHNEAE EISKDWAAAG TDVMLHAVLE TAEIVA