Gene TM1040_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2056 
Symbol 
ID4077983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2159766 
End bp2160791 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID638007375 
Productallophanate hydrolase subunit 2 
Protein accessionYP_614050 
Protein GI99081896 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGC GCCTTATTGT TCACCGCGCA GGACCGGGTC TGAGCATTCA GGATCTTGGC 
CGTAGCGGCT ATCTCGCCTT TGGTCTGTCG CGCGGTGGCG CGGCGGATCG GCTGGCGCTT
TATGAGGGCG CGGCGCTCTT GGGACAGGAA CCAAGCGCGG CGGCCATCGA GATGGCCGGG
CTTGGCGGCA CATTTGAGGT CACAACCGAT ACCCGTATCG CCCTCACAGG TGCGCCGATG
AAGGCCACGC TACAGGACGG CTCGGAATTG CGCTGGAATG CCAGCCACCT TCTTGCGGCC
GGGATGCAGC TCAGCATTGG CGCAGTGCGG GCGGGGTCGT ATGGCTACCT TCATGTGGGC
GGCGGCATCG CAAACGCGCT GCAGCTTGGC GCGCGCAGTG CGCATCTCGC CTCGGGGCTT
GGCGCACGTC TCCGGGATGG GGCGGAGCTG CCCCTTGGTG ACGATGCAGG GGGTGCAGTC
AATATGACCC TGACCCCCGA GCCGCGCCTG GACGGCGGCA CGCTGCGCAT GGTGCCAAGC
CTGCAAACCA GCCTTTTTGG CGCGGCAGAG GTGGCCCGCT TTCAAGAGGT ACGCTTTCAC
CGTGACAGCC GCGCCAATCG CATGGGGGTG CGGCTCTTGC CGGAGGGGCA GGGGTTTGCG
CTTGAGGGGG GCTTGAGCGT TCTTTCCGAG GTGATCGCAC CCGGTGACAT TCAGGTCACC
GGCGATGGCA CGCCCTATGT TTTGATGAGC GAATGCCAGA CCACCGGCGG CTATCCCCGC
ATCGGCTCTG TTCTGCCTTG CGATATGCCG CGCGTGGCAC AGGCACAGGC AGGAGCGGCG
TTTCGCTTTG AACAGGTGAC ACTTGAGGAA GCGGTCGAGA TTGAACGGCG GGCCCGCGCC
GAGCGCGAGC GTCTGCCCTC CCGGCTGACG CCGCTTGTGC GTGATCCGGC CCGAATGCGG
GATCTTCTGT CCTATCAACT GGTGAGCGGC GTGACCGCCG GGCGCGATCT TGATGAGGCG
CTCTGA
 
Protein sequence
MSVRLIVHRA GPGLSIQDLG RSGYLAFGLS RGGAADRLAL YEGAALLGQE PSAAAIEMAG 
LGGTFEVTTD TRIALTGAPM KATLQDGSEL RWNASHLLAA GMQLSIGAVR AGSYGYLHVG
GGIANALQLG ARSAHLASGL GARLRDGAEL PLGDDAGGAV NMTLTPEPRL DGGTLRMVPS
LQTSLFGAAE VARFQEVRFH RDSRANRMGV RLLPEGQGFA LEGGLSVLSE VIAPGDIQVT
GDGTPYVLMS ECQTTGGYPR IGSVLPCDMP RVAQAQAGAA FRFEQVTLEE AVEIERRARA
ERERLPSRLT PLVRDPARMR DLLSYQLVSG VTAGRDLDEA L