Gene Gura_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3799 
Symbol 
ID5166158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4438486 
End bp4439646 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content55% 
IMG OID640551282 
Productglycosyl transferase, group 1 
Protein accessionYP_001232523 
Protein GI148265817 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATAG GGATCAGCGC CCTGAATTTT TCTCCCGGCG AGATGGGGGG GCAGGAGACA 
TATTTCAGGA ATCTCGTTCA CCATCTGCAG CGGGTGGACA GGGAAAACAG CTATGCCCTG
TTGTGCGATG CACGCAAGGT CCGCGAGTTC CCCCTCTCCA ACGACTCGTT CAGGGTTACG
CTCTGCAATT ACGACAAACC CTCGCTCAAC TGGCTCATCC GTGGCATGCT GAAAAAGATG
ATCCATCTGG ATCTGGTAAA CCTGCGACTG AAAGGGCTAA AGCTCGATGT CATTCATCAT
CCGTTCACTG TCCTGAACCC CCAGTGGTCC CAGATACCCT CAGTGTTGAC CTTTCTTGAC
ATGCAGCAGG AATATTTCCC GCAGTTTTTC TCAAAACTCG AATTGGCCAT CCGTAAACAG
ATATCCCGCC CTTCTGCTGA AAAGGCGACC AGGATCATCG CCATTTCCCG GCATGTTAAG
GACTGTCTTG TGGAAAAATA CGGGATTGAT GCAGGGAAAA TTGACGTGAT TTATCCCGGT
TGCGGCGCCG AATTCCGGGT AATTGACGAT GCCGTGGGGC TGGCGGAGCT GAAGCTCCGC
TACGGCCTGG AAAGGCCGTT TGCCTATTAT CCGGCGGCGA GCTGGCCCCA TAAGAATCAC
AAGACACTCC TGGCGGCCTG GAAGATTTTG CAGGAGAGGC GCGGCTTTGA CGGCCAGCTC
GTCCTTACCG GCATTGCCAA ACAGGCGCAC GGCGAAATCC AGGGGGAGGT CGGCAGGCTC
GGTCTTGATG CTACGGTGAA GGTCTTGGGC TATCTACCTT CCGATGAACT CCCGTACCTT
TACAACCTTG CCCGGCTGAT GGTTTTCCCT TCGCTCTTCG AGGGGTTCGG CATTCCGCTG
GTGGAGGCCA TGGCCTGCGG TTGTCCGCTA GTATGCTCCA CGGCGACATC CGTTCCCGAG
GTGGCTGGTG ATGCCGGGAT TCAGTTCGAT CCCCTTTCTC CGGAGGATAT GGCCGACAAG
CTCTGGATGG TCTGGAATGA CGAAGGAGCC AGAGAGCAGT TGAGGGTTAA GGGGTTGCAG
AGGGTGAAAC TGTTTGACTG GGAAAATACG GCGCGTAAGA CCCTGGAGGT TTATCAGAAG
GCTGCGGGTG GTGCCAGGTG A
 
Protein sequence
MHIGISALNF SPGEMGGQET YFRNLVHHLQ RVDRENSYAL LCDARKVREF PLSNDSFRVT 
LCNYDKPSLN WLIRGMLKKM IHLDLVNLRL KGLKLDVIHH PFTVLNPQWS QIPSVLTFLD
MQQEYFPQFF SKLELAIRKQ ISRPSAEKAT RIIAISRHVK DCLVEKYGID AGKIDVIYPG
CGAEFRVIDD AVGLAELKLR YGLERPFAYY PAASWPHKNH KTLLAAWKIL QERRGFDGQL
VLTGIAKQAH GEIQGEVGRL GLDATVKVLG YLPSDELPYL YNLARLMVFP SLFEGFGIPL
VEAMACGCPL VCSTATSVPE VAGDAGIQFD PLSPEDMADK LWMVWNDEGA REQLRVKGLQ
RVKLFDWENT ARKTLEVYQK AAGGAR