Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2590 |
Symbol | |
ID | 5164121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 2997757 |
End bp | 3000627 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640550086 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001231340 |
Protein GI | 148264634 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCC GGAAGACACG TTCAGCAGGT ATGCAGAATT TATTGTACGT GGCATTCTGC TGCGCTTTGT CATTATTCGC TTTTACGTTG AATGCGAGAG TCGGTCATGC CCAGGAACGC CTGCCTGGTG CGATAACTCA GCCTTTCAAG CTTCATGTCG ACGAAATAAA AAGAGCCAGG GCTTCCTATG CTACACTGCC GGTTATGCCC AGTATTTTGG GTTTTTTTGA AATATATGAA ACGTATGGAA ATGGCAGCAG AAAAAGGCTG TTGAACGGCC CATGGCGGAC CGGCACGCTG CCGGGCCACC TGACCAGGGA TTCGGGCAAA AAATACACAA TTTCCTCCCG GGACGCGGCA ACGGGCGTTT CCTACAGGAT TACCTATAAA CGGGCCGCTG AGGATGAGTT GCTCGTCGAG TTTAAGCTCC AGATGCCTGT GGGAGTTGAA AAACCCGGAA TTGAATTTGA GATATGCAAG CTATCAGGTG ATGTTTTCAA GGGAGCTAAG GTGCAGGCTG TTCCCGGTAT ATCCGGGAAC TCCGGGATTT TGCCTCTGGA GCCGCGTCCT GTGCAAAACC GTTTCCTGTA TAACGATAAA AGTGAAATTC TTGTCAAGGG CAAGGTTTGC GATATACGCA TCAAGGACCT TGCCGGAGGC AATTCCATCA ATATCGCCGA TTTTAGGAAT ATCCCATGGG ATTCAAAAAA GAGTTTTCAT TTTTATGGGG GGAAAAAAGG CCTGACTCCG GGCAAGGAGT ATCAATTCGG GTATTCCATC CGTTTCCTCC CTCCATCTGT GGCGAGGTCC AGCGCAAATG CGCCCCATGT GCCGGATGCT CTCGTGGCAC AGACCCCCGA CAACTTACAG CGGTTTCTTT CTGTAACGCC GAAGGAATAT AAAACTGCCG AAGGCTGCTA TCTGCTTACG CCTGGAGAGT TCATTTTTGC TCCGGTAAAT GATCCAGCTC AACAGATGTT GGTTTCTGAA ATCAAGAGCA TTACCAGGCT TCCCATTTAT GCAAGACCAT TGGACAGGGG CCAGTCTGGG AGAGGTATAT ATATTGAGAA TTTGTCAAAG GCGAATACGC TGTCTCACTC TCTTCCTCAG GAAGGGTTTG AACTGATCAT CAATCCGGAC CGGGTTGTGG TGAGGGGGGC CGATGCTAGG GGCTGTTTGT ACGGGACGTA TGCATTGCTC GGCAGGATCC GGCAGGACAA GGGGGGATGG GGTATACCCT GTGGTACGGT GCGAGATTGG CCGGACCTGC GGACACGAGG AATATGTGTG GAAATGCTGT CTCCGCAGCG AAATGATATT AACCTGTTTA AACGCTATGT GCTTGCGTTT TCACATGCGA GGGCCAATCT GCTGATCTTT CACTTTTATC CACAGCATGT TGTGAAGTGG AACACCGGTA AAGGTCGTAA CGATTGGACA CCGGAACAGA TTGCCGAGGT CGCCGATTAT GCGAGAAGCT TGGGCATGGA AGTTTGGGCA GGCATGGTGG CAAAATTCGA TGCGTCAGCT TTTCCCCAAC TTCCCATGCT GCAATCGGCA AACATCTATA ATCCTCTGGA AGAGCGTTCC TATAACTTTC TGTTTTCTCT GTATGAACGC ATAATTGCTT CAATCAAACC AACGGTAATG TTAATTGGCC ATGATGAGGT TAAAGGTCTT TCCCTCTATG CCGGAAAAGA ACCTGAAAAG ACAGGTAAGC TTTTTGCGGC GGATATAAGG AAGCTCCACG ATTGGCTTGC TTCCCGGGGC GTAGGCACTG CGATGTGGGG AGACATGCTG CTGGACGACA GCAGATGGTC TGGCGAAGTC GGTGACGCCA ATAGCAACAA TCCTGTCTAC AATTCAGGCG CGACCCACCT GGCAATCGAC CACATTCCCA AGGATGTGAA GATCCTGGAC TGGCATTACG GAGAGATGCC CGGATACCGC AGCATCGACT ATTTCCGCAA ACACGGCTTC CAGGTATATG GCAGTCCCTG GCATTTCCCC CGGGCGACAA AGGCTCTCGC GAAAAGCGTA AAGGAGTATC AGGGGCAGGG GATGATCGGC ACGGACTGGG GGTTTTGGCG GACTCTATCG TCCTCAGCTA CAACACTGTA TGCTCCACTC TGTGGTTGGA CAAATAATTG TGACATAAGT CAGGATGATG TTGCCGTGAT GGCGGCTAAC CTGAGGGGGA AAGACCCACT CCCGATGAGC ATGTTGAGGC AAGTTCCAGT TGATCTGCAG CCAAACTGCA ACAGGTCAAC ATGGGATGTA TCCGCAGGAT CCGGTAAGGG TATCTTTGGG GTAGGGCCTC AACTGGACCT GCGGGATTTA CGGCCGGGAA ATCAGATCAG GGGTGGTGTA ACTTTTTCTC TTCTGCCTGC CGAAGAAGGA CGCAGGTACA ATTGTGTCGC TGTTATGGGT GGAGGCAATG GTTTAGTAAA CGAAAACCGG ACAAGCCGGA TTGTGGTTAA AGATCAATTG GCTCAGCAGA TAGCGTTTTT GCATACCGCT TTTCTGGAAG AGCCGCAGGT AAATCCGCGC AAGCTTGGGG AATATGTCAT AGAATTTCAA AGCGGCCGCC AGGAAACTGT AAGCTTGACG GAAAACGTGA ATATCACAGA TGTGCGCTCA AGTGAAGGTC TCCGGGACAA CAGCTGGACC TTTACCAGGT CACCAGATGT CCTACTGGAT TCAGTTCCGG GCTGGCGTGG AGTATCCGGT ATTGGTCTCC CATTGAATAT GCAGGTCTTT ATCTGGCGGA ATCCATACCC TGATGAAAAA ATCACAAGCA TTCGACTTCG TGCGACTGAA AAGCAGCCTA AATTGCATTT GGCACTACTG GGAGTGACAT TGCTGCAATG A
|
Protein sequence | MKTRKTRSAG MQNLLYVAFC CALSLFAFTL NARVGHAQER LPGAITQPFK LHVDEIKRAR ASYATLPVMP SILGFFEIYE TYGNGSRKRL LNGPWRTGTL PGHLTRDSGK KYTISSRDAA TGVSYRITYK RAAEDELLVE FKLQMPVGVE KPGIEFEICK LSGDVFKGAK VQAVPGISGN SGILPLEPRP VQNRFLYNDK SEILVKGKVC DIRIKDLAGG NSINIADFRN IPWDSKKSFH FYGGKKGLTP GKEYQFGYSI RFLPPSVARS SANAPHVPDA LVAQTPDNLQ RFLSVTPKEY KTAEGCYLLT PGEFIFAPVN DPAQQMLVSE IKSITRLPIY ARPLDRGQSG RGIYIENLSK ANTLSHSLPQ EGFELIINPD RVVVRGADAR GCLYGTYALL GRIRQDKGGW GIPCGTVRDW PDLRTRGICV EMLSPQRNDI NLFKRYVLAF SHARANLLIF HFYPQHVVKW NTGKGRNDWT PEQIAEVADY ARSLGMEVWA GMVAKFDASA FPQLPMLQSA NIYNPLEERS YNFLFSLYER IIASIKPTVM LIGHDEVKGL SLYAGKEPEK TGKLFAADIR KLHDWLASRG VGTAMWGDML LDDSRWSGEV GDANSNNPVY NSGATHLAID HIPKDVKILD WHYGEMPGYR SIDYFRKHGF QVYGSPWHFP RATKALAKSV KEYQGQGMIG TDWGFWRTLS SSATTLYAPL CGWTNNCDIS QDDVAVMAAN LRGKDPLPMS MLRQVPVDLQ PNCNRSTWDV SAGSGKGIFG VGPQLDLRDL RPGNQIRGGV TFSLLPAEEG RRYNCVAVMG GGNGLVNENR TSRIVVKDQL AQQIAFLHTA FLEEPQVNPR KLGEYVIEFQ SGRQETVSLT ENVNITDVRS SEGLRDNSWT FTRSPDVLLD SVPGWRGVSG IGLPLNMQVF IWRNPYPDEK ITSIRLRATE KQPKLHLALL GVTLLQ
|
| |