Gene Gura_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2004 
Symbol 
ID5166982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2321079 
End bp2322305 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content60% 
IMG OID640549498 
Producthypothetical protein 
Protein accessionYP_001230767 
Protein GI148264061 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAC TCAAGCTATC TACGTTTCTC GTTGCCGCTG TTCTTGCAGC CGGAGTCCCC 
GCCGCTCAGG CGACGCCATT CCATGCCGGC GGCACCGGCA ACTGCGATGG ATGTCACGGT
CCGAACTTTA CCGCAGGAAG CTACTCAACA TTGCGCGGCG TTGATCCTGG CTCGACCTGC
CTTCGTTGCC ATGGCGCAGC CAGGCCGACT GAGCATCAAA TAGCCACTCA TCCCGTTCCG
CCCAAGGGAA TTCCTCCCGT GTCCCTTACC CCCGGCGGCG ATTTTGCTTA TCTGCGGAAA
AACTATTTCT GGGGCGATTC CAACGGCAAG CGTGGCATAA GTCCCGGAGA AAGACACGGA
CACAACATCG TTGCAGCCGC TTACGGCTAT TCCCGGGACA CGGCCCTCCT CGGTTCCCCG
GGCGGCTCAT ATCCGGCTGA TGCCCTATCC TGCATCAGCT GCCATGATCC CCACGGGAAC
TACCGGGTGC TGGACAGATA CGGCACTGTT TCGAGCGAGG GGAACCCCAT CGGCGAGGCG
GGTTCTTACG GGGCAGCAGC AACAAGCGCC AGTTCCGTCG GCACTTACCG CTTGCTGGCC
GGCAAGGGAT ATCAAACAAA ATCTGCCGGG CACGTTTTTT CCTACGATCC GCCGATGGCC
GTTTCCCCCA CCAGCTACAA CAGATCCGAG GCAAGTTCTG ATACGAGGGT TGCCTACGGC
AAAGGGGTGT CGAAATGGTG CGCAAACTGC CACGAAGGTT TCCTGTCTGG GACGAGCCAT
ATCCACCCGG CTGACGTGGA GCTTGGTGTG GCAATAGCCG CGACCTACAA CAATTACGTG
AAGTCCGGCG ATCTTACCGG CATCCGGGCC ACCGCCTATA CCTCGCTGGT CCCGTTCCAG
AGTGGCGAAG TGACCGACCC GCAGCAACTG TCTGCCGAGT TGACTTCTAC CGCCGGTCCC
AACCCGGACG ACAGGATAAC CTGCCTTACC TGCCATCGGG CCCATGCCTC CGGCTGGGAC
AGCATAGGCA GGTGGAACAT GAAGGGGGAC TTCCTGACCG TGGCCGGCGC GTATCCGGGA
ATCGACACCA ATGGTACGGG GAATTACGGC GAGAACTCCA CCGGCAAACT CAGAACCGAG
TATCAGGCGG CCATGTACGG CCGGGACGCA TCCGGATTCG CAACTTTTCA ACGGCAGCTT
TGCGATAAGT GCCATGCGAA GGATTGA
 
Protein sequence
MNRLKLSTFL VAAVLAAGVP AAQATPFHAG GTGNCDGCHG PNFTAGSYST LRGVDPGSTC 
LRCHGAARPT EHQIATHPVP PKGIPPVSLT PGGDFAYLRK NYFWGDSNGK RGISPGERHG
HNIVAAAYGY SRDTALLGSP GGSYPADALS CISCHDPHGN YRVLDRYGTV SSEGNPIGEA
GSYGAAATSA SSVGTYRLLA GKGYQTKSAG HVFSYDPPMA VSPTSYNRSE ASSDTRVAYG
KGVSKWCANC HEGFLSGTSH IHPADVELGV AIAATYNNYV KSGDLTGIRA TAYTSLVPFQ
SGEVTDPQQL SAELTSTAGP NPDDRITCLT CHRAHASGWD SIGRWNMKGD FLTVAGAYPG
IDTNGTGNYG ENSTGKLRTE YQAAMYGRDA SGFATFQRQL CDKCHAKD