Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3804 |
Symbol | |
ID | 5166593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4446679 |
End bp | 4448427 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640551287 |
Product | glycosyl transferase family protein |
Protein accession | YP_001232528 |
Protein GI | 148265822 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAATA TTCGATATAG CATCTCGATT GTCGTTCTCA ATAATCTGGA TATTACCAAG AAGTGCCTTC AATTGATCGA AGCCAATTCA GGTCCTGATT ATGAACTCAT CATCAGCGAT AACGGTTCAG TCGATGGTAC TGCCGAATTT CTGCGCGACT ACTCCCTTTG CCATGCAAAC TGTAAAATCA TTTCCTATGC GTACAACACG GGGTTTGGAC ACCCACACAA TCAGGCCCTG CTTGAGGCGC GTGGTGAATA TTTTGTTGTC CTCAATAACG ATATCTTCAT CCATGAGTCA GGCTGGCTGG AGAAGCTTTC CAGGTGTTTT GACGAAAATG CCAGGTTGGC AATTGTAGGT TTTACAGGTG GTTATTCTGC ACTGGATGAT ATCGGTCATG GTGTGAAGGA ACCAGCAAGA GGGGAGTATA TACAGGCTTC TTGCCTGATG ATCCCGAGGA GAATGGCCGA TAGTTTTGGA CTTTTTGCCG AAGAGTATGA GTTGGCATAC TGGGAAGATG TAGACTTATC CTTGAGATAC CGTCAGATGG GGTTCGATAT CCAGCTTGTC AGTTGCTTAC ACGAACATAT TGATAATGCC ACCTCTGCCA CTATCGATAA AGTGTTGTTG TCGAGGGTTC GCGCCCGCAA TCACAAAACG TTTCAAAAGA GATGGTCCGG TTACCTGGAG AAACGCTCTT TCACCGGCCG CGTGTTGATT TGTGCCGTTA CCGACAGTCT TGACAGCCTG CTGGCCGTTA CCACGGTGCT GCAGAGGCTC AAGGACGAGC AATTATTGGT AAACTTCGAC CTGATCACCA ATCATCCTGA GGTGTTCTTA CTTCATCCCG CTGTTGACGC CTGTTTTCTG CCGAACGAGG TACCAGATCA TGAAGGCTAT GACAGGATCA TCGACCTTGA TTTCAGTAGT ATCCCCCCCG GACAACCGGT GGTCCTTGCT TTGGCAGAGC AGGCTGTCGT AACCATTACC CGGCTTTTTC CTGTACTTCA TCTGGATTCG TTGAAACAGA CCATTGCAAC TCCCTTCTTG GTTCCAGGCA AGGAATATGC TGTCCTGACG GTTTCGTGCG AGGACGAGCA TTTTGAGCAG ATCAAGGCCC TTGTTGCTGA AGCAGGGTAT GAGCCGGTTA TAGTGGAGGT TCTTGACGAT GGGGCGGCTT ACAGTATCGA CAACGGTGAG CGAGTCGACT TTCGTGCAAT GGCGGCAGCG GTTTCTACGA GCAGATTGTT AATCGGCCCT AGCGGAGTAC CTCTGAAGGT TGCCCAGGCG TTAGGTGTGC CGGTGATAGC CATATTCCGG CAAGGGGAGG ATCCGCTTGC TCTAGGGATT GATTTCGCGC GTGCTAGCTG GATTTTTGCG CGCAGTGATC TGCCTCGGCA ATTGGAAGAT GTTCTTAATG AAGAGGGAGA TAGGGCTGAA GCGGAATTAG CCTATTTGCA GAAATCGATA CTGCAACGCA TGGCCGACTA TAATGCCTTA TGGCTTGGAT ACAGGGGGCA GACTGAGCGG ATCGAGCAAA TTGCACAGGA GTATGCTGAA CTGGAGAATC AGTTTCGATG CAAATCGCTA GAATATGACG AAATCATTGC AAGGCAAGAA CAAGTAATTG CAGAGTACGA TCATAAAATT AGTGCCTTGG AGAATTCTTT GATCTGGAGA GTAACCAGCC CGATCAGGCG GATGATTGAT TTTATGCTGA ATCAGTTTCG GGGGAATTCA TGCGGCTGA
|
Protein sequence | MGNIRYSISI VVLNNLDITK KCLQLIEANS GPDYELIISD NGSVDGTAEF LRDYSLCHAN CKIISYAYNT GFGHPHNQAL LEARGEYFVV LNNDIFIHES GWLEKLSRCF DENARLAIVG FTGGYSALDD IGHGVKEPAR GEYIQASCLM IPRRMADSFG LFAEEYELAY WEDVDLSLRY RQMGFDIQLV SCLHEHIDNA TSATIDKVLL SRVRARNHKT FQKRWSGYLE KRSFTGRVLI CAVTDSLDSL LAVTTVLQRL KDEQLLVNFD LITNHPEVFL LHPAVDACFL PNEVPDHEGY DRIIDLDFSS IPPGQPVVLA LAEQAVVTIT RLFPVLHLDS LKQTIATPFL VPGKEYAVLT VSCEDEHFEQ IKALVAEAGY EPVIVEVLDD GAAYSIDNGE RVDFRAMAAA VSTSRLLIGP SGVPLKVAQA LGVPVIAIFR QGEDPLALGI DFARASWIFA RSDLPRQLED VLNEEGDRAE AELAYLQKSI LQRMADYNAL WLGYRGQTER IEQIAQEYAE LENQFRCKSL EYDEIIARQE QVIAEYDHKI SALENSLIWR VTSPIRRMID FMLNQFRGNS CG
|
| |