Gene TM1040_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2135 
Symbol 
ID4076449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2241583 
End bp2242659 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID638007455 
Product5-amino-6-(5-phosphoribosylamino)uracil reductase / diaminohydroxyphosphoribosylaminopyrimidine deaminase 
Protein accessionYP_614129 
Protein GI99081975 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.885222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTCG CCCTGTCGCT CGGGCGGCGA GGGCAGGGTC GGACTTGGCC AAACCCGGCG 
GTCGGCTGTG TAATTGTCCA AAAGGGTCGC GTGGTGGGCC GGGGCTGGAC CCAGCCCGGA
GGCCGTCCCC ACGCCGAACC CATGGCGCTG GCGCAGGCGG GAGCCGCAGC GCGCGGCGCC
ACTGCCTATG TGAGCCTCGA ACCCTGTGCC CATCACGGCA AAACCCCCCC CTGCGCGCAA
GCGCTGATCG AGGCCGGTGT TGCCCGTGTC GTCGCCGCCA TCGAAGACAG CGACCCGCGT
GTCAGCGGTC AGGGCTTTGA GATGCTGCGC GCGGCGGGGA TTTCCGTTAC CACCGGAGTG
CGCGCCGAGG AAGCTGGCTT TGATCACGAA GGGTTCTTTC TAAAAACGGA ACAGGGCCGC
CCTTTTGTGA CGCTGAAACT CGCCGCGAGT TTTGATGGTC GTATTGCCAC CGGCTCCGGT
CAAAGCCAGT GGATCACCGG GCCGGAGGCG CGCCGTGTGG TGCATGCGAT GCGTGCGCGT
CACGATGCTG TCATGGTCGG GGCAGGGACG GCACGCGCGG ATGACCCTTC GCTCACCGTG
CGCGATCTGG GGATTGACCA GCAGCCGGCG CGGGTGGTGG TCTCGCGCCA TCTTGACCTG
CCGCTCATCA GCAAGCTTGC GCGCAGCGCA GCGGAGGTCC CGCTCTATCT TTGCCATGGC
ACAGGCGCGG ATACCGAACG TCTGCGGGCC TGGGACGGGC TGGGAGCGCA TCTGTTGCCG
TGCAACGCTC TTGGCACCCA GCTTGACCCG CATGATGTGC TGCAGCAACT GGGCAGCGTA
GGACTCACAC GCGTGTTCTG CGAAGGAGGA GGCGCGCTGG CGGCCAGCCT GCTGGCGCAT
GACCTCGTGG ATGAGTTGGT GGGCTTCAGT GCTGGTCTGA CGATCGGTGC CGAAGGGCTG
CCCTCCATCG GGGCGCTTGG CATTGGCCAC CTTTCAGAGG CCCCAAGGTT CGACCTTCAT
GAGACACGCC CGATTGGCGC CGACATCCTG CACCGCTGGC GTCGCCCTCA GAACTGA
 
Protein sequence
MGLALSLGRR GQGRTWPNPA VGCVIVQKGR VVGRGWTQPG GRPHAEPMAL AQAGAAARGA 
TAYVSLEPCA HHGKTPPCAQ ALIEAGVARV VAAIEDSDPR VSGQGFEMLR AAGISVTTGV
RAEEAGFDHE GFFLKTEQGR PFVTLKLAAS FDGRIATGSG QSQWITGPEA RRVVHAMRAR
HDAVMVGAGT ARADDPSLTV RDLGIDQQPA RVVVSRHLDL PLISKLARSA AEVPLYLCHG
TGADTERLRA WDGLGAHLLP CNALGTQLDP HDVLQQLGSV GLTRVFCEGG GALAASLLAH
DLVDELVGFS AGLTIGAEGL PSIGALGIGH LSEAPRFDLH ETRPIGADIL HRWRRPQN