Gene Gura_2590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2590 
Symbol 
ID5164121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2997757 
End bp3000627 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content50% 
IMG OID640550086 
Productglycoside hydrolase family protein 
Protein accessionYP_001231340 
Protein GI148264634 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCC GGAAGACACG TTCAGCAGGT ATGCAGAATT TATTGTACGT GGCATTCTGC 
TGCGCTTTGT CATTATTCGC TTTTACGTTG AATGCGAGAG TCGGTCATGC CCAGGAACGC
CTGCCTGGTG CGATAACTCA GCCTTTCAAG CTTCATGTCG ACGAAATAAA AAGAGCCAGG
GCTTCCTATG CTACACTGCC GGTTATGCCC AGTATTTTGG GTTTTTTTGA AATATATGAA
ACGTATGGAA ATGGCAGCAG AAAAAGGCTG TTGAACGGCC CATGGCGGAC CGGCACGCTG
CCGGGCCACC TGACCAGGGA TTCGGGCAAA AAATACACAA TTTCCTCCCG GGACGCGGCA
ACGGGCGTTT CCTACAGGAT TACCTATAAA CGGGCCGCTG AGGATGAGTT GCTCGTCGAG
TTTAAGCTCC AGATGCCTGT GGGAGTTGAA AAACCCGGAA TTGAATTTGA GATATGCAAG
CTATCAGGTG ATGTTTTCAA GGGAGCTAAG GTGCAGGCTG TTCCCGGTAT ATCCGGGAAC
TCCGGGATTT TGCCTCTGGA GCCGCGTCCT GTGCAAAACC GTTTCCTGTA TAACGATAAA
AGTGAAATTC TTGTCAAGGG CAAGGTTTGC GATATACGCA TCAAGGACCT TGCCGGAGGC
AATTCCATCA ATATCGCCGA TTTTAGGAAT ATCCCATGGG ATTCAAAAAA GAGTTTTCAT
TTTTATGGGG GGAAAAAAGG CCTGACTCCG GGCAAGGAGT ATCAATTCGG GTATTCCATC
CGTTTCCTCC CTCCATCTGT GGCGAGGTCC AGCGCAAATG CGCCCCATGT GCCGGATGCT
CTCGTGGCAC AGACCCCCGA CAACTTACAG CGGTTTCTTT CTGTAACGCC GAAGGAATAT
AAAACTGCCG AAGGCTGCTA TCTGCTTACG CCTGGAGAGT TCATTTTTGC TCCGGTAAAT
GATCCAGCTC AACAGATGTT GGTTTCTGAA ATCAAGAGCA TTACCAGGCT TCCCATTTAT
GCAAGACCAT TGGACAGGGG CCAGTCTGGG AGAGGTATAT ATATTGAGAA TTTGTCAAAG
GCGAATACGC TGTCTCACTC TCTTCCTCAG GAAGGGTTTG AACTGATCAT CAATCCGGAC
CGGGTTGTGG TGAGGGGGGC CGATGCTAGG GGCTGTTTGT ACGGGACGTA TGCATTGCTC
GGCAGGATCC GGCAGGACAA GGGGGGATGG GGTATACCCT GTGGTACGGT GCGAGATTGG
CCGGACCTGC GGACACGAGG AATATGTGTG GAAATGCTGT CTCCGCAGCG AAATGATATT
AACCTGTTTA AACGCTATGT GCTTGCGTTT TCACATGCGA GGGCCAATCT GCTGATCTTT
CACTTTTATC CACAGCATGT TGTGAAGTGG AACACCGGTA AAGGTCGTAA CGATTGGACA
CCGGAACAGA TTGCCGAGGT CGCCGATTAT GCGAGAAGCT TGGGCATGGA AGTTTGGGCA
GGCATGGTGG CAAAATTCGA TGCGTCAGCT TTTCCCCAAC TTCCCATGCT GCAATCGGCA
AACATCTATA ATCCTCTGGA AGAGCGTTCC TATAACTTTC TGTTTTCTCT GTATGAACGC
ATAATTGCTT CAATCAAACC AACGGTAATG TTAATTGGCC ATGATGAGGT TAAAGGTCTT
TCCCTCTATG CCGGAAAAGA ACCTGAAAAG ACAGGTAAGC TTTTTGCGGC GGATATAAGG
AAGCTCCACG ATTGGCTTGC TTCCCGGGGC GTAGGCACTG CGATGTGGGG AGACATGCTG
CTGGACGACA GCAGATGGTC TGGCGAAGTC GGTGACGCCA ATAGCAACAA TCCTGTCTAC
AATTCAGGCG CGACCCACCT GGCAATCGAC CACATTCCCA AGGATGTGAA GATCCTGGAC
TGGCATTACG GAGAGATGCC CGGATACCGC AGCATCGACT ATTTCCGCAA ACACGGCTTC
CAGGTATATG GCAGTCCCTG GCATTTCCCC CGGGCGACAA AGGCTCTCGC GAAAAGCGTA
AAGGAGTATC AGGGGCAGGG GATGATCGGC ACGGACTGGG GGTTTTGGCG GACTCTATCG
TCCTCAGCTA CAACACTGTA TGCTCCACTC TGTGGTTGGA CAAATAATTG TGACATAAGT
CAGGATGATG TTGCCGTGAT GGCGGCTAAC CTGAGGGGGA AAGACCCACT CCCGATGAGC
ATGTTGAGGC AAGTTCCAGT TGATCTGCAG CCAAACTGCA ACAGGTCAAC ATGGGATGTA
TCCGCAGGAT CCGGTAAGGG TATCTTTGGG GTAGGGCCTC AACTGGACCT GCGGGATTTA
CGGCCGGGAA ATCAGATCAG GGGTGGTGTA ACTTTTTCTC TTCTGCCTGC CGAAGAAGGA
CGCAGGTACA ATTGTGTCGC TGTTATGGGT GGAGGCAATG GTTTAGTAAA CGAAAACCGG
ACAAGCCGGA TTGTGGTTAA AGATCAATTG GCTCAGCAGA TAGCGTTTTT GCATACCGCT
TTTCTGGAAG AGCCGCAGGT AAATCCGCGC AAGCTTGGGG AATATGTCAT AGAATTTCAA
AGCGGCCGCC AGGAAACTGT AAGCTTGACG GAAAACGTGA ATATCACAGA TGTGCGCTCA
AGTGAAGGTC TCCGGGACAA CAGCTGGACC TTTACCAGGT CACCAGATGT CCTACTGGAT
TCAGTTCCGG GCTGGCGTGG AGTATCCGGT ATTGGTCTCC CATTGAATAT GCAGGTCTTT
ATCTGGCGGA ATCCATACCC TGATGAAAAA ATCACAAGCA TTCGACTTCG TGCGACTGAA
AAGCAGCCTA AATTGCATTT GGCACTACTG GGAGTGACAT TGCTGCAATG A
 
Protein sequence
MKTRKTRSAG MQNLLYVAFC CALSLFAFTL NARVGHAQER LPGAITQPFK LHVDEIKRAR 
ASYATLPVMP SILGFFEIYE TYGNGSRKRL LNGPWRTGTL PGHLTRDSGK KYTISSRDAA
TGVSYRITYK RAAEDELLVE FKLQMPVGVE KPGIEFEICK LSGDVFKGAK VQAVPGISGN
SGILPLEPRP VQNRFLYNDK SEILVKGKVC DIRIKDLAGG NSINIADFRN IPWDSKKSFH
FYGGKKGLTP GKEYQFGYSI RFLPPSVARS SANAPHVPDA LVAQTPDNLQ RFLSVTPKEY
KTAEGCYLLT PGEFIFAPVN DPAQQMLVSE IKSITRLPIY ARPLDRGQSG RGIYIENLSK
ANTLSHSLPQ EGFELIINPD RVVVRGADAR GCLYGTYALL GRIRQDKGGW GIPCGTVRDW
PDLRTRGICV EMLSPQRNDI NLFKRYVLAF SHARANLLIF HFYPQHVVKW NTGKGRNDWT
PEQIAEVADY ARSLGMEVWA GMVAKFDASA FPQLPMLQSA NIYNPLEERS YNFLFSLYER
IIASIKPTVM LIGHDEVKGL SLYAGKEPEK TGKLFAADIR KLHDWLASRG VGTAMWGDML
LDDSRWSGEV GDANSNNPVY NSGATHLAID HIPKDVKILD WHYGEMPGYR SIDYFRKHGF
QVYGSPWHFP RATKALAKSV KEYQGQGMIG TDWGFWRTLS SSATTLYAPL CGWTNNCDIS
QDDVAVMAAN LRGKDPLPMS MLRQVPVDLQ PNCNRSTWDV SAGSGKGIFG VGPQLDLRDL
RPGNQIRGGV TFSLLPAEEG RRYNCVAVMG GGNGLVNENR TSRIVVKDQL AQQIAFLHTA
FLEEPQVNPR KLGEYVIEFQ SGRQETVSLT ENVNITDVRS SEGLRDNSWT FTRSPDVLLD
SVPGWRGVSG IGLPLNMQVF IWRNPYPDEK ITSIRLRATE KQPKLHLALL GVTLLQ