Gene Bind_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1041 
SymbolureC 
ID6199952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1193551 
End bp1195263 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content58% 
IMG OID641705033 
Producturease subunit alpha 
Protein accessionYP_001832173 
Protein GI182678027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.497339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA CCCTACCCCG CCCCGCTTAT GCCGGCATGT TTGGACCGAC CACGGGCGAC 
AAGGTTCGCC TTGCCGATAC GGAGCTTTTC ATCGAGATCG AACGCGATTT CACCCTTTAT
GGCGAGGAAG TGAAATTTGG CGGCGGCAAG GTCATTCGCG ACGGGATGGG ACAAGGCCAG
GCCTCAAAAG CCGAGGGCGC GGCGGATACA ATTATCACCA ATGCCGTGAT CATCGACCAT
TGGGGCATTG TCAAAGCCGA TGTCGGGCTG CGTGATGGGC GCATTATCGG CATTGGCAAG
GCGGGCAATC CCGATGTCCA GCCGGGCATC GATCTCATCA TCGGCCCTGG CACCGAAATC
ATTGCCGGTG AGGGACGCAT TCTCACCGCT GGCGGGTTCG ACAGCCATAT TCATTTCATC
TGCCCTCAAC AAATTGAAGA GGCCTTGGCC TCGGGCATGA CGACAATGCT CGGCGGGGGC
ACCGGCCCAG CGACAGGCAC TTTCGCGACG ACCTGCACGC CAGGACCCTG GCATATTGCC
CGGATGATCG AAGCCTCTGA CGGTTTCGCC ATGAACCTCG GTTTTGCCGG CAAGGGCAAT
GCCTCCAGAT CTGAAGGTCT CGTCGAGCAG ATCGAGGCGG GCGCTTGCGC CCTGAAACTG
CATGAGGATT GGGGCACGAC ACCAGCGGCC ATCGACTGCT GCCTGTCCGT CGCTGATGAT
CACGATATAC AGGTCATGAT CCACACGGAT ACATTGAACG AATCCGGTTT CGTCGAGGAC
ACGATCCGCG CCTTCAAGGG GCGCACCATT CATGCCTTCC ACACCGAAGG CGCCGGAGGC
GGCCATGCGC CCGACATTAT GAAAGTCGCG GGCCTGCCCA ATGTCCTGCC TTCCTCGACC
AATCCGACAC GGCCCTTCAC CGTCAATACG CTCGACGAAC ATCTCGACAT GCTGATGGTT
TGCCATCATC TCGATCCCTC CATTGCGGAG GATCTCGCCT TCGCCGAAAG CCGTATCCGC
AAGGAAACCA TTGCGGCTGA GGATATTCTG CACGACCTTG GTGCTTTATC GATGATGTCC
TCGGATAGTC AGGCCATGGG ACGCATCGGC GAGGTGATCA CACGCACCTG GCAGACAGCC
GATAAGATGA AGCGTCAGCG CGGACCACTC CCCGAAGACA AGAGCAATAA CGACAATTTC
CGTGTGCGCC GTTACATTGC CAAATACACA ATCAATCCGG CCATCACTCA TGGCGTTTCG
CGTCACATCG GCTCGATCGA GCCCGGCAAG CTCGCCGATC TTGTTTTATG GACGCCTGCT
TTTTTCGGCG TGAAGCCGGA TCTCGTCATC AAGGGCGGTA TGATCGCCTA TGCGATGATG
GGCGATCCCA ACGCCTCGAT CCCGACACCG CAACCCGTGC ATGGGCGCCC AATGTTCGGA
AGTTTTGGCG GGGCACGGAC CGGCACGTCC TTAACTTTTA CGTCGAAGAC GGCCCTGGCG
CATGGCCTGG CCCAAAAGCT CAAAATTTCG CGTAAATTAG TACCCGTCGA AAACACCCGC
GGAAATTTGC GCAAGACGAG CCTGATCCTC AACGGCGCGA TGCCTCACAT CGAGATCGAT
CCGGAAACCT ATGTGGTCAA GGCTGATGGC ATGGTACTGA CCTGCGAGCC AGCGAGGAGC
CTGCCCATGG CGCAGCGCTA TTTTCTGTTC TGA
 
Protein sequence
MAVTLPRPAY AGMFGPTTGD KVRLADTELF IEIERDFTLY GEEVKFGGGK VIRDGMGQGQ 
ASKAEGAADT IITNAVIIDH WGIVKADVGL RDGRIIGIGK AGNPDVQPGI DLIIGPGTEI
IAGEGRILTA GGFDSHIHFI CPQQIEEALA SGMTTMLGGG TGPATGTFAT TCTPGPWHIA
RMIEASDGFA MNLGFAGKGN ASRSEGLVEQ IEAGACALKL HEDWGTTPAA IDCCLSVADD
HDIQVMIHTD TLNESGFVED TIRAFKGRTI HAFHTEGAGG GHAPDIMKVA GLPNVLPSST
NPTRPFTVNT LDEHLDMLMV CHHLDPSIAE DLAFAESRIR KETIAAEDIL HDLGALSMMS
SDSQAMGRIG EVITRTWQTA DKMKRQRGPL PEDKSNNDNF RVRRYIAKYT INPAITHGVS
RHIGSIEPGK LADLVLWTPA FFGVKPDLVI KGGMIAYAMM GDPNASIPTP QPVHGRPMFG
SFGGARTGTS LTFTSKTALA HGLAQKLKIS RKLVPVENTR GNLRKTSLIL NGAMPHIEID
PETYVVKADG MVLTCEPARS LPMAQRYFLF