Gene TM1040_3255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3255 
Symbol 
ID4075397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp256334 
End bp257305 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content64% 
IMG OID638004764 
ProductUrea carboxylase 
Protein accessionYP_611491 
Protein GI99078233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.166744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.612176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTG AGATTGTGAA GCCGGGACTT GCAACCACTG TGCAGGATCT GGGGCGCCCG 
GGCTATTTCC ATCTTGGAAT CCCCGAGGGC GGTGCGATGG ATCGCCTTGC GCTGCGGGCG
GCGAACATGC TGGTCGGAAA CGAGGATGGC GCGGCCTGTC TCGAGGCGGT CTTTATGGGC
CCCGAGGTGA AGTTTGGCGC GGATATGACC GTCGCTGTGA CCGGGGCTGA GCTGCCGGTG
CTGCTGGACG GCATGCCACG CAACACGTGG TCGAGCATTT CGGTCAAAGC CGGGCAGGTG
CTCTCGTTTG GGTTCCTCAA AGAGGGGGCG CGGATCTATA TCGCCGTCTC AGGCGGGATC
GACACGCCGC CTGCGTTGGG GTCTCGTTCC ACCTACGCGA TCGGCGCGCT TGGCGGCTTT
GAGGGCCGTC CCGTTGCGGC CGGAGATGTG ATCCCGCTGG GGCGGGGCGC GGGGATGCCG
GAGGGGCGCA TAGTGCCAGA CGCGCTGCGC CGCCGCCCTG CAAAACCCGC CGCCTTGCGC
GTGCTTCCCG GCCTCTACTG GCACCGTTTG ACCGAAAAAA GCCAGGCAGC GTTTTTTGAG
GATGACTGGA CCGTCGCCCC GGAAGCAGAT CGCATGGGCT ACCGGTTCAG AGGCGGGCAG
GCGATGGAAT TTGTTGATCG TGATCAACCG TTTGGAGCCG GATCAGACCC GTCCAACATC
GTCGATGGCT GCTATTCCTA TGGCTCCATC CAGGTGCCCG GAGGGCTCGA GCCCATCGTT
TTGCACCGCG ACGCGGTCTC GGGCGGGGGG TATTTCACCC TCGGGGCCGT GATCTCGGCG
GATATGGACC TGATCGGCCA ACTGCAGCCC AATACGCCGG TCAAATTCAT GCGCGTTGAT
ATGGATCAGG CGCTTGCCGC TCGAAAGGCC CGCAAAGAGA CCATCGAGCA GATCCGTCAG
GCGCTCTCCT AG
 
Protein sequence
MTLEIVKPGL ATTVQDLGRP GYFHLGIPEG GAMDRLALRA ANMLVGNEDG AACLEAVFMG 
PEVKFGADMT VAVTGAELPV LLDGMPRNTW SSISVKAGQV LSFGFLKEGA RIYIAVSGGI
DTPPALGSRS TYAIGALGGF EGRPVAAGDV IPLGRGAGMP EGRIVPDALR RRPAKPAALR
VLPGLYWHRL TEKSQAAFFE DDWTVAPEAD RMGYRFRGGQ AMEFVDRDQP FGAGSDPSNI
VDGCYSYGSI QVPGGLEPIV LHRDAVSGGG YFTLGAVISA DMDLIGQLQP NTPVKFMRVD
MDQALAARKA RKETIEQIRQ ALS