Gene TM1040_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0385 
SymbolureC 
ID4078618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp393082 
End bp394794 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content62% 
IMG OID638005680 
Producturease subunit alpha 
Protein accessionYP_612380 
Protein GI99080226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA ACATCCCTCG CTCTGACTAT GCCGCCATGT ATGGGCCGAC CACCGGCGAC 
CGCGTGCGGC TGGCCGATAC TGACCTGATC ATCGAGGTGG AGCGCGACCT CACCGGCCCT
TACGGCGAAG AGGTCAAATT TGGCGGCGGC AAGGTGATCC GCGACGGGAT GGGACAGGCG
CAGACGACGC GCGCGGGCGG CGCGGTCGAT ACCGTGATTA CCAACGCGCT GATCCTTGAT
TGGACCGGCA TCTACAAGGC TGACGTCGGG CTGAAAGACG GGCGCATCCA TGCCATCGGT
AAGGCCGGCA ACCCCGACAC TCAACCCAAT GTGACCATTA TCGTTGGCCC CGGCACCGAG
GTGATCGCGG GCGAAGGGCG CATCCTGACG GCAGGCGGGT TTGACAGCCA TATCCATTAT
ATCTGCCCGC AACAGATCGA GGATGCGCTG CACTCGGGTC TGACCACCAT GCTCGGCGGC
GGCACCGGGC CAGCCCATGG GACTTTGGCC ACCACCTGCA CCCCCGGTGC CTGGCACCTG
GGTCGGATGA TGCAGGCCGC AGATGCGTTT CCGATGAACC TGGCGTTTGC AGGCAAAGGG
AATGCCTCGC TGCCCGCCGC CATTGAAGAA CAGGTTAACG CAGGCGCCTG CGCGCTGAAA
CTGCATGAGG ACTGGGGCAC CACACCCGCT GCCATCGACT GCTGCCTCGG GGTGGCCGAT
GCAATGGATG TGCAGGTGAT GATCCATACA GACACGCTCA ATGAGTCGGG GTTTGTGGAA
CACACCGTCA AGGCCATGAA AGGCCGCACC ATCCACGCCT TTCACACCGA AGGTGCGGGC
GGTGGCCACG CACCGGACAT CATCAAGATC TGCGGTGAGG AGTTCGTGTT GCCCTCCTCG
ACCAACCCGA CCCGCCCCTT CACCGTGAAC ACCATCGAAG AGCACCTCGA CATGCTCATG
GTCTGTCATC ACCTCGACAA ATCCATCCCC GAGGATGTGG CCTTTGCCGA GAGCCGGATC
CGGCGCGAAA CCATTGCCGC CGAGGACATC CTGCACGACA TGGGGGCCTT CTCGATCATC
GCAAGCGACA GCCAGGCGAT GGGACGCGTG GGCGAGGTCA TCATTCGCAC ATGGCAGACC
GCAGACAAGA TGAAGAAACA GCGCGGCCGC CTGAGCGAGG AAACAGGTGA GAACGACAAC
TTTCGCGTGC GGCGCTATGT GGCAAAATAC ACCATCAACC CGGCGATCGC GCATGGGATC
GCGCATGAAA TTGGCTCTAT CGAGGTGGGC AAGCGCGCGG ATCTGGTGCT GTGGAACCCA
GCCTTCTTTG GTGTAAAGCC CGAGATGGTC CTGATGGGCG GCACCATCGC CTGCGCGCAA
ATGGGCGATC CCAACGCCTC CATTCCGACG CCGCAGCCGG TCTATTCGCG TCCCATGTGG
GGCGCTTACG GGCGCTCGGT CGAGCATTCT GCCGTCACCT TTGTGTCCGA GGCCGCGCAG
GCCGCAGGCA TCGGTAAGAC ACTGGGTCTT GCAAAACAGA CACTTGCGGT AAAGGGCACA
CGCGGGATCG GGAAGTCAGC ACTCAAGCTC AACACCGCCA CGCCTGAGAT CGAGGTTCAC
CCTGAAACCT ATGAGGTACG CGCGAATGGG GAGCTTTTGA CCTGTCAGCC CGCCGAGGAA
CTGCCCTTGG CACAGCGATA TTTCCTCTTC TAA
 
Protein sequence
MPANIPRSDY AAMYGPTTGD RVRLADTDLI IEVERDLTGP YGEEVKFGGG KVIRDGMGQA 
QTTRAGGAVD TVITNALILD WTGIYKADVG LKDGRIHAIG KAGNPDTQPN VTIIVGPGTE
VIAGEGRILT AGGFDSHIHY ICPQQIEDAL HSGLTTMLGG GTGPAHGTLA TTCTPGAWHL
GRMMQAADAF PMNLAFAGKG NASLPAAIEE QVNAGACALK LHEDWGTTPA AIDCCLGVAD
AMDVQVMIHT DTLNESGFVE HTVKAMKGRT IHAFHTEGAG GGHAPDIIKI CGEEFVLPSS
TNPTRPFTVN TIEEHLDMLM VCHHLDKSIP EDVAFAESRI RRETIAAEDI LHDMGAFSII
ASDSQAMGRV GEVIIRTWQT ADKMKKQRGR LSEETGENDN FRVRRYVAKY TINPAIAHGI
AHEIGSIEVG KRADLVLWNP AFFGVKPEMV LMGGTIACAQ MGDPNASIPT PQPVYSRPMW
GAYGRSVEHS AVTFVSEAAQ AAGIGKTLGL AKQTLAVKGT RGIGKSALKL NTATPEIEVH
PETYEVRANG ELLTCQPAEE LPLAQRYFLF