Gene Gura_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4041 
Symbol 
ID5165928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4698342 
End bp4700081 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content54% 
IMG OID640551520 
Producthypothetical protein 
Protein accessionYP_001232758 
Protein GI148266052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAG GGAACAATGT GCCTACGATA GAAGCGGATG ATTTTGCTCG ACGTTTTTCT 
TTACGGGCTG GGAACCTGAT GTGGCTACTC GGTGCCGGTG CATCGGCCTC AGCTGGGATA
CCCACTGCCG GAGATATGGT CTGGGAGTTT AAGCAGCAAC TATTCATCAG CCAACGACGA
GTTTCGCACC AGTCTATGGC CGATCTGTCG AATCCCACAA TTCGTGCTCA GTTACAGGCC
TACGTTGATT CCTCAGGGAG TTTGCCTTCT CCCGGCTCCC CGGACGAGTA CGCAGCACTT
TTTGAAGCAG TTTATCCTGC AGAGTCGGAT CGTCGCGCCT ATTTGGATGC CAAAATGGGT
GGAGCAAAGT TGTCCTATGG ACACTTGGCC CTCGCTTCTC TGATGCATGC ACAACTTACG
CGTTTGGTGT GGACAACCAA CTTCGATCCA CTCGTGGCGG ATGCTTGTGC GAAGGTATAT
GACGGAACAG GACCACTCAC GACTGTTGCT CTCGAAGCGC CTGATTTAGC TGCTCAGTGC
ATAGGAGGAG GAAGATGGCC AATCGAGGTA AAGTTGCATG GAGACTTCAG ATCTCGGCGA
CTCAAAAACA CTGGTGATGA ATTGCGTTAC CAAGATCAAC GTTTAAGGCA ACTTCTAGTA
GATTCCTGCA AACGCTTTGG ACTTGTTGTA GTCGGGTACA GTGGCCGTGA CGATTCCATT
ATGGATGCAC TGGAGGAGGT GCTGGAACAA AATGGCGCTT ACCCTTCCGG ATTGTTCTGG
CTGCATCGCG GGGAAGATCC ACCGCTCGCC CGAGTTGAAC AATTGCTGGC GCGAGCGAAA
CAGGCTGGAG TGGAGTCAGC ACTGATAAGG GTTCAAAACT TCGATGAGGC AATGCGAGAC
TTGGTGCGAA TGGTAAAAAG CATCGACACC ACGATACTCG ACACCTTCGC AGCCGAGCGC
CGCCGCTGGA GTAGCGCCCC GCCGCCAGGA GGAAAACGAG GCTGGCCGGT GGTGCGCCTC
AATGCAATAC CTGTCGTACA AATTCCAACA GTATGTCGAC GCGTTGTCTG TGAGATCGGC
GGCCACGCAG AAGCACGGGA AGCTGTCAAG CAGGCTGGCG TTGACGTCCT CGTTGCTCGC
ACTCGGGCTG GCGTGTTGGC CTTTGGAGCC GATGGCGACG TGCGTGCAGC TTTTGGTGGT
TACAACATTA CTGATTTTGA CCTACATACC ATCGACAACA AGCGACTGCG CTACGATTCC
GGCGAGCGTG GCCTGTTACG CAGTGCACTT ACCCGTGCTT TAGAACGCCA TCATCGATTG
GACGCAACTC GTCGGAGAAG TGCTGATTTG TTAGCACCAT CGGACCCAAG AGAGAGTGTC
TGGGCACCTC TAAAGCAGCT TGTAGGATCA CTTAACGGTA CGGTCAGCGG TTTCCCCGGT
TTGCATTGGC GCGAAGGAAT CGGTACTCGG CTTGACTGGG CTGATGAACG CTTGTGGCTT
CTGATAGAAC CCCGCACGGT CTTCGACGGC ATTAACGACG AGAACAAGGC GGCTGCCGCC
GATTTCGCCC GTGAGCGAAC TGTCAAGCGT TATAATAAGC AACTCAACGA TCTAATCGTA
TTTTGGGCTG ATCTGCTCTC CGGCGGAGGA GACCTGCGTG CGTTAGATAT CGGAGGTGGG
GTCGATGCCG TCTTTAGCCT TTCCAATATT ACAGGTTTTT CAAGGAGGGC TGGGGTATGA
 
Protein sequence
MGKGNNVPTI EADDFARRFS LRAGNLMWLL GAGASASAGI PTAGDMVWEF KQQLFISQRR 
VSHQSMADLS NPTIRAQLQA YVDSSGSLPS PGSPDEYAAL FEAVYPAESD RRAYLDAKMG
GAKLSYGHLA LASLMHAQLT RLVWTTNFDP LVADACAKVY DGTGPLTTVA LEAPDLAAQC
IGGGRWPIEV KLHGDFRSRR LKNTGDELRY QDQRLRQLLV DSCKRFGLVV VGYSGRDDSI
MDALEEVLEQ NGAYPSGLFW LHRGEDPPLA RVEQLLARAK QAGVESALIR VQNFDEAMRD
LVRMVKSIDT TILDTFAAER RRWSSAPPPG GKRGWPVVRL NAIPVVQIPT VCRRVVCEIG
GHAEAREAVK QAGVDVLVAR TRAGVLAFGA DGDVRAAFGG YNITDFDLHT IDNKRLRYDS
GERGLLRSAL TRALERHHRL DATRRRSADL LAPSDPRESV WAPLKQLVGS LNGTVSGFPG
LHWREGIGTR LDWADERLWL LIEPRTVFDG INDENKAAAA DFARERTVKR YNKQLNDLIV
FWADLLSGGG DLRALDIGGG VDAVFSLSNI TGFSRRAGV