Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_0719 |
Symbol | |
ID | 6369441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | - |
Start bp | 736906 |
End bp | 738084 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642676111 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001950965 |
Protein GI | 189423788 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.121181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGT CCGACCAGTC ATCATCAGGC AGCACACCGC TGCCGCCCTG CAGCCGCCGC ACCTTTCTGA AAGGTGCCGC CCTGGCCGGT ACGGTGGCCG CCTTCCCGTC CCTGATGGCC GGCTGCGGCA GTTCATCAGG GGATGTTCCA TTGGAACAGA AGATAGCCCA GATGCTGATG GTGGGCTTCC GCGGTCTGAC CCTGGATGAC AGCAACTACA TTGTCCGGGA CATCCGCGAC TACCGTATCG GCGGAACCAT CCTCTTTGAC CGGGATGCAA AGCTTAAGAC CTACGGGCGC AACATCGTTT CGCCGGAGCA GTTGCAAGGG CTGACGGCCC AGCTGCGCAC ACTTTCCGCC ACCCCGCTCT TCATTGCCGT TGATCAGGAG GGGGGGAAAA TCGTGCGACT GAAGGAGAGC TACGGCTTTC CCCCCACGGT CTCGGCCCAG CAGCTCGGCA CCCTCAATAA CCCGGCGGTT ACCCGGCTGT ATGCCGACAG TATCGGTGCC ACTCTGGCAA CAAACGGGTT GAACATGAAC TTTGCACCGG TGGTTGACCT GAACATCAAC CCCCAAAGCC CGGCCATCGG CGCACTGGAG CGCAGTTTTT CCGCAGATCC GTCCATTGTC ACCAACCATG CCCGGATCTT TGTGGAGACC CACGACGCAA ACCGGGTCGC CACCTGTTTC AAACATTTCC CCGGCCACGG CAGCGCCACC GCCGACTCCC ATCTGGGGTT TGTGGATGTG ACCGATACCT GGTCGGCTGT TGAACTTGAG CCGTACCGGA ATCTGATCAA CGCAGACAAG GCCAGGATGG TGATGACCGC CCATGTCTTT AACCGGCATA TTGATCCTGA TCTGCCCGCA ACCCTCTCAC AACCGTTCAT CACCGGCATA CTGCGTGAAC AGTTGGGGTT TAACGGGGTG GTGGTAACCG ATGACCTGCA GATGCAGGGG CTGACCCAGT TCTTTGATTA CAAGACCATT GTCGAAAAAA GTATCCTGGC CGGGGTGGAT ATTATCCTGG TTTCCAACAA CCTGGAGTAT GATCCCGAGA TCACCCCCAC CACCATTAAT CATGTGGTTG ATCTGGTGAA CAGCGGCAGG ATATCCGAAC AACGGATTGA TCAGTCCTAC CGGCGTATCA TGGCACTGAA GGGACGCCTG TTTGCCTGA
|
Protein sequence | MEQSDQSSSG STPLPPCSRR TFLKGAALAG TVAAFPSLMA GCGSSSGDVP LEQKIAQMLM VGFRGLTLDD SNYIVRDIRD YRIGGTILFD RDAKLKTYGR NIVSPEQLQG LTAQLRTLSA TPLFIAVDQE GGKIVRLKES YGFPPTVSAQ QLGTLNNPAV TRLYADSIGA TLATNGLNMN FAPVVDLNIN PQSPAIGALE RSFSADPSIV TNHARIFVET HDANRVATCF KHFPGHGSAT ADSHLGFVDV TDTWSAVELE PYRNLINADK ARMVMTAHVF NRHIDPDLPA TLSQPFITGI LREQLGFNGV VVTDDLQMQG LTQFFDYKTI VEKSILAGVD IILVSNNLEY DPEITPTTIN HVVDLVNSGR ISEQRIDQSY RRIMALKGRL FA
|
| |