Gene Gura_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2119 
Symbol 
ID5166834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2476823 
End bp2478031 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID640549615 
Producthypothetical protein 
Protein accessionYP_001230880 
Protein GI148264174 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACAA GTGCAGGTGT GGGTCATAGC TTTCACAGGG ATCCGGCAGT AGCAGGTAAA 
GAGGCCGCCC GGAAAGCCCT GGAGCAGGCA AAGATCGACA AACCCGATTT TGTCTTCGTG
TTTGCTACCG TCGGGTATAA CCAGAAGGTC CTGATAAAAG CGATAAGAGA AGCAACTTCA
CGGGCTCCTT TAAGCGGTTG CTCGGGCGAA GGAATAATAA CGCAGGAAAC CGTTATCGAA
ACCAACTTTG GCGTTTCAGT CATGGCGATC AGCTCGGACG AGCTTCGGTT CAAAAACGCC
CGCGTAAAAG AAATAGCCGG GCAATCCTAT AAAGCCGGAG AGCGCCTGGC AGAAGAGGTG
AATCCATTGT TGACTTCCGA CGACATCGCC TGCTTTCTTT TTGCCGACGG GCTGGTTTTT
GATTTCGATC CTTTTTGGGC GGCGTTCGAT AAATCACTGC GCAGCAAAAG CCCGCTCCCT
CTTTTCGGCG GATTGGCTGC GGACAACTGG ACATCCCAGA AAACCTACCA GTACCACGAC
GATGATGTTT TTTCCGAGGG CATTTCGTGT GTTGTCATGT CGGGCAAAGG GGATGTTGCC
TGGGGAATAA ACCACGGCTG TGTCCCGATC GGAACCAAAC GCACGATCAC ACGAAGCAAG
GGAAACATCA TCTACGAGAT CGACGGGGTT CCGGCCCTCG AAGCCCTTAA GGACTACTTT
GAAGAAGACT GGATTGACCA CTGGAACAAA ACGACTTTGA ATATATGTCT TGGATTCAAA
ACCCCTGAGC ATCTAAGGAA GGGTTATGAA GAATACATCA TCAGGTACAT AGTGGGAAAG
GACGATCGGG AAGGATCTGT AACAATACAG TCCGATGTCC GGGAAGGCAC CGACCTCTGG
CTGGTTCGCC GCGACAAGGA ATTGATAACG AGCGGCATGA AGGCAATCTC CCGACAGATA
AAGGACAAGA CAGGAACCCG GAAGCCGAAA TTCGTTCTGC AATTCGAATG CGTCGGCCGG
GGAAAGGTCG TTTTTCGCGA GGGGGAGAAG ATCGAACTGA TCAGGTCTCT TCAGAAAGAC
ATCGGAGGAG ATATCCCGTG GATGGGTTTT TACACTTATG GCGAAATCGG CCCGATCACG
AAGTATAATT GTTTTCACAA CTTCACGTCC GTAATCAGCG CAGTATGCTG TAAAGAGCGG
GAATCATGA
 
Protein sequence
MGTSAGVGHS FHRDPAVAGK EAARKALEQA KIDKPDFVFV FATVGYNQKV LIKAIREATS 
RAPLSGCSGE GIITQETVIE TNFGVSVMAI SSDELRFKNA RVKEIAGQSY KAGERLAEEV
NPLLTSDDIA CFLFADGLVF DFDPFWAAFD KSLRSKSPLP LFGGLAADNW TSQKTYQYHD
DDVFSEGISC VVMSGKGDVA WGINHGCVPI GTKRTITRSK GNIIYEIDGV PALEALKDYF
EEDWIDHWNK TTLNICLGFK TPEHLRKGYE EYIIRYIVGK DDREGSVTIQ SDVREGTDLW
LVRRDKELIT SGMKAISRQI KDKTGTRKPK FVLQFECVGR GKVVFREGEK IELIRSLQKD
IGGDIPWMGF YTYGEIGPIT KYNCFHNFTS VISAVCCKER ES