Gene Gura_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3804 
Symbol 
ID5166593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4446679 
End bp4448427 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content49% 
IMG OID640551287 
Productglycosyl transferase family protein 
Protein accessionYP_001232528 
Protein GI148265822 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAATA TTCGATATAG CATCTCGATT GTCGTTCTCA ATAATCTGGA TATTACCAAG 
AAGTGCCTTC AATTGATCGA AGCCAATTCA GGTCCTGATT ATGAACTCAT CATCAGCGAT
AACGGTTCAG TCGATGGTAC TGCCGAATTT CTGCGCGACT ACTCCCTTTG CCATGCAAAC
TGTAAAATCA TTTCCTATGC GTACAACACG GGGTTTGGAC ACCCACACAA TCAGGCCCTG
CTTGAGGCGC GTGGTGAATA TTTTGTTGTC CTCAATAACG ATATCTTCAT CCATGAGTCA
GGCTGGCTGG AGAAGCTTTC CAGGTGTTTT GACGAAAATG CCAGGTTGGC AATTGTAGGT
TTTACAGGTG GTTATTCTGC ACTGGATGAT ATCGGTCATG GTGTGAAGGA ACCAGCAAGA
GGGGAGTATA TACAGGCTTC TTGCCTGATG ATCCCGAGGA GAATGGCCGA TAGTTTTGGA
CTTTTTGCCG AAGAGTATGA GTTGGCATAC TGGGAAGATG TAGACTTATC CTTGAGATAC
CGTCAGATGG GGTTCGATAT CCAGCTTGTC AGTTGCTTAC ACGAACATAT TGATAATGCC
ACCTCTGCCA CTATCGATAA AGTGTTGTTG TCGAGGGTTC GCGCCCGCAA TCACAAAACG
TTTCAAAAGA GATGGTCCGG TTACCTGGAG AAACGCTCTT TCACCGGCCG CGTGTTGATT
TGTGCCGTTA CCGACAGTCT TGACAGCCTG CTGGCCGTTA CCACGGTGCT GCAGAGGCTC
AAGGACGAGC AATTATTGGT AAACTTCGAC CTGATCACCA ATCATCCTGA GGTGTTCTTA
CTTCATCCCG CTGTTGACGC CTGTTTTCTG CCGAACGAGG TACCAGATCA TGAAGGCTAT
GACAGGATCA TCGACCTTGA TTTCAGTAGT ATCCCCCCCG GACAACCGGT GGTCCTTGCT
TTGGCAGAGC AGGCTGTCGT AACCATTACC CGGCTTTTTC CTGTACTTCA TCTGGATTCG
TTGAAACAGA CCATTGCAAC TCCCTTCTTG GTTCCAGGCA AGGAATATGC TGTCCTGACG
GTTTCGTGCG AGGACGAGCA TTTTGAGCAG ATCAAGGCCC TTGTTGCTGA AGCAGGGTAT
GAGCCGGTTA TAGTGGAGGT TCTTGACGAT GGGGCGGCTT ACAGTATCGA CAACGGTGAG
CGAGTCGACT TTCGTGCAAT GGCGGCAGCG GTTTCTACGA GCAGATTGTT AATCGGCCCT
AGCGGAGTAC CTCTGAAGGT TGCCCAGGCG TTAGGTGTGC CGGTGATAGC CATATTCCGG
CAAGGGGAGG ATCCGCTTGC TCTAGGGATT GATTTCGCGC GTGCTAGCTG GATTTTTGCG
CGCAGTGATC TGCCTCGGCA ATTGGAAGAT GTTCTTAATG AAGAGGGAGA TAGGGCTGAA
GCGGAATTAG CCTATTTGCA GAAATCGATA CTGCAACGCA TGGCCGACTA TAATGCCTTA
TGGCTTGGAT ACAGGGGGCA GACTGAGCGG ATCGAGCAAA TTGCACAGGA GTATGCTGAA
CTGGAGAATC AGTTTCGATG CAAATCGCTA GAATATGACG AAATCATTGC AAGGCAAGAA
CAAGTAATTG CAGAGTACGA TCATAAAATT AGTGCCTTGG AGAATTCTTT GATCTGGAGA
GTAACCAGCC CGATCAGGCG GATGATTGAT TTTATGCTGA ATCAGTTTCG GGGGAATTCA
TGCGGCTGA
 
Protein sequence
MGNIRYSISI VVLNNLDITK KCLQLIEANS GPDYELIISD NGSVDGTAEF LRDYSLCHAN 
CKIISYAYNT GFGHPHNQAL LEARGEYFVV LNNDIFIHES GWLEKLSRCF DENARLAIVG
FTGGYSALDD IGHGVKEPAR GEYIQASCLM IPRRMADSFG LFAEEYELAY WEDVDLSLRY
RQMGFDIQLV SCLHEHIDNA TSATIDKVLL SRVRARNHKT FQKRWSGYLE KRSFTGRVLI
CAVTDSLDSL LAVTTVLQRL KDEQLLVNFD LITNHPEVFL LHPAVDACFL PNEVPDHEGY
DRIIDLDFSS IPPGQPVVLA LAEQAVVTIT RLFPVLHLDS LKQTIATPFL VPGKEYAVLT
VSCEDEHFEQ IKALVAEAGY EPVIVEVLDD GAAYSIDNGE RVDFRAMAAA VSTSRLLIGP
SGVPLKVAQA LGVPVIAIFR QGEDPLALGI DFARASWIFA RSDLPRQLED VLNEEGDRAE
AELAYLQKSI LQRMADYNAL WLGYRGQTER IEQIAQEYAE LENQFRCKSL EYDEIIARQE
QVIAEYDHKI SALENSLIWR VTSPIRRMID FMLNQFRGNS CG