Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3799 |
Symbol | |
ID | 5166158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4438486 |
End bp | 4439646 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640551282 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001232523 |
Protein GI | 148265817 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATAG GGATCAGCGC CCTGAATTTT TCTCCCGGCG AGATGGGGGG GCAGGAGACA TATTTCAGGA ATCTCGTTCA CCATCTGCAG CGGGTGGACA GGGAAAACAG CTATGCCCTG TTGTGCGATG CACGCAAGGT CCGCGAGTTC CCCCTCTCCA ACGACTCGTT CAGGGTTACG CTCTGCAATT ACGACAAACC CTCGCTCAAC TGGCTCATCC GTGGCATGCT GAAAAAGATG ATCCATCTGG ATCTGGTAAA CCTGCGACTG AAAGGGCTAA AGCTCGATGT CATTCATCAT CCGTTCACTG TCCTGAACCC CCAGTGGTCC CAGATACCCT CAGTGTTGAC CTTTCTTGAC ATGCAGCAGG AATATTTCCC GCAGTTTTTC TCAAAACTCG AATTGGCCAT CCGTAAACAG ATATCCCGCC CTTCTGCTGA AAAGGCGACC AGGATCATCG CCATTTCCCG GCATGTTAAG GACTGTCTTG TGGAAAAATA CGGGATTGAT GCAGGGAAAA TTGACGTGAT TTATCCCGGT TGCGGCGCCG AATTCCGGGT AATTGACGAT GCCGTGGGGC TGGCGGAGCT GAAGCTCCGC TACGGCCTGG AAAGGCCGTT TGCCTATTAT CCGGCGGCGA GCTGGCCCCA TAAGAATCAC AAGACACTCC TGGCGGCCTG GAAGATTTTG CAGGAGAGGC GCGGCTTTGA CGGCCAGCTC GTCCTTACCG GCATTGCCAA ACAGGCGCAC GGCGAAATCC AGGGGGAGGT CGGCAGGCTC GGTCTTGATG CTACGGTGAA GGTCTTGGGC TATCTACCTT CCGATGAACT CCCGTACCTT TACAACCTTG CCCGGCTGAT GGTTTTCCCT TCGCTCTTCG AGGGGTTCGG CATTCCGCTG GTGGAGGCCA TGGCCTGCGG TTGTCCGCTA GTATGCTCCA CGGCGACATC CGTTCCCGAG GTGGCTGGTG ATGCCGGGAT TCAGTTCGAT CCCCTTTCTC CGGAGGATAT GGCCGACAAG CTCTGGATGG TCTGGAATGA CGAAGGAGCC AGAGAGCAGT TGAGGGTTAA GGGGTTGCAG AGGGTGAAAC TGTTTGACTG GGAAAATACG GCGCGTAAGA CCCTGGAGGT TTATCAGAAG GCTGCGGGTG GTGCCAGGTG A
|
Protein sequence | MHIGISALNF SPGEMGGQET YFRNLVHHLQ RVDRENSYAL LCDARKVREF PLSNDSFRVT LCNYDKPSLN WLIRGMLKKM IHLDLVNLRL KGLKLDVIHH PFTVLNPQWS QIPSVLTFLD MQQEYFPQFF SKLELAIRKQ ISRPSAEKAT RIIAISRHVK DCLVEKYGID AGKIDVIYPG CGAEFRVIDD AVGLAELKLR YGLERPFAYY PAASWPHKNH KTLLAAWKIL QERRGFDGQL VLTGIAKQAH GEIQGEVGRL GLDATVKVLG YLPSDELPYL YNLARLMVFP SLFEGFGIPL VEAMACGCPL VCSTATSVPE VAGDAGIQFD PLSPEDMADK LWMVWNDEGA REQLRVKGLQ RVKLFDWENT ARKTLEVYQK AAGGAR
|
| |