Gene Gura_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3852 
Symbol 
ID5167007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4501462 
End bp4502769 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID640551334 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001232575 
Protein GI148265869 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAGA CTCAACTTGA CTACGCCCGC CAGGGCACGA TCACCAAAGA AATGAAGGAA 
GCAGCCCTCG CCGAAGGGGT AAGCCCGGAG TTCATCCGCG ACGGACTGGT TGCCGGCAAC
ATCATCATCT GCCATAACAT CAAGCACGCA GGCGGTCGAC CGCTGGCGGT AGGCCGCGGA
CTGCGCACCA AGGTCAACGC CAACATCGGC ACCTCGGCCG ACGACCTGGA CATAGCCAAG
GAGCTGGAAA AGGCCCGCGT AGCGGTAAAA CACGGCGCAG ACGCCATCAT GGACCTCTCC
ACCGGCGGAC CGGTTGATGA GATCCGCCGC GCCATCATTG CCGAAACCAG TGCCTGCATC
GGCAGCGTAC CCCTCTATCA GGCGGCCCTC GATGCGGTAC GGACAAAGAA GAAGGCGATC
GTCGACATGA CCGTGGACGA CATTTTCGCC GGGATAATCA AGCATGCCGA AGACGGAGTG
GATTTCATCA CCGTCCACTG CGGCGTGACC TGCGCAACGG TGGAGCGGAT GAAAAACGAG
GGTCGGATCA TGGACGTGGT CTCCCGCGGC GGGGCGTTCA CCATCGAGTG GATGGCCCAC
AACAACAAGG AAAACCCGCT CTTCGAGCAC TTCGACCGGC TCCTGGAAAT CACCAAAGAG
TATGACATGA CCCTCTCCCT GGGTGACGGC TTCCGCCCCG GCTGCCTCGC CGACGCCACC
GACCGGGCGC AGATCCACGA ACTGATCCTT CTGGGCGAGC TGACCCAGCG CGCCCAGGCA
TTCGGCGTCC AGGTCATGAT TGAAGGTCCG GGGCACATGC CGCTCAACCA GATCGAGGCC
AACATCCTCC TGCAGAAGAG GCTCTGTCAC GGCGCCCCAT TCTATGTGCT CGGCCCGCTG
GTCACCGACA TCGCCCCGGG CTACGACCAT ATCACCTGCG CCATCGGCGG CACCATCGCC
GCCGCCGCCG GGGCCGACTT CCTCTGCTAT GTCACCCCCA GCGAACACCT GCGCCTCCCG
ACCGTGGACG ACGTGAGAGA AGGGGTCATC GCCTCCCGCA TCGCCGCCCA CGCTGCCGAC
ATCGTCAAGG GGGTGAAGGG GGCGATGGAC AAGGACATCC AGATGGCCAA GTGCCGGAAA
AAGCTCGACT GGGAAGGGCA GTTCGCCCTG GCCCTCGACC CGGAAAAGGC CCGGCGGCTG
CGCGCCGAAT CAGGGGTTGC CGACCACGGC GCCTGCACCA TGTGCGGCGA GTTCTGCGCC
TACAAGGTGA TGGACGACGC CATGGAAAAG CAGGCGGTCG AATCGTAA
 
Protein sequence
MTKTQLDYAR QGTITKEMKE AALAEGVSPE FIRDGLVAGN IIICHNIKHA GGRPLAVGRG 
LRTKVNANIG TSADDLDIAK ELEKARVAVK HGADAIMDLS TGGPVDEIRR AIIAETSACI
GSVPLYQAAL DAVRTKKKAI VDMTVDDIFA GIIKHAEDGV DFITVHCGVT CATVERMKNE
GRIMDVVSRG GAFTIEWMAH NNKENPLFEH FDRLLEITKE YDMTLSLGDG FRPGCLADAT
DRAQIHELIL LGELTQRAQA FGVQVMIEGP GHMPLNQIEA NILLQKRLCH GAPFYVLGPL
VTDIAPGYDH ITCAIGGTIA AAAGADFLCY VTPSEHLRLP TVDDVREGVI ASRIAAHAAD
IVKGVKGAMD KDIQMAKCRK KLDWEGQFAL ALDPEKARRL RAESGVADHG ACTMCGEFCA
YKVMDDAMEK QAVES