Gene Gura_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1920 
Symbol 
ID5165515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2224614 
End bp2225645 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content41% 
IMG OID640549414 
ProductNHL repeat-containing protein 
Protein accessionYP_001230683 
Protein GI148263977 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00542747 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTACTCCA GACTGTTTCT TGTATTAGTG TTTTTTACCA TTTTCGGCTG TACCACTGTT 
GAACCACTCG TACTGCGAGA TACTAAGATT GATCTTGCAT GGCCACTCCC TCCAAACTCC
CCCCGGATAA GATTCCTCCG CACGATAAAT GGTCCAGATA ATATTATTAC CGCTCCTGGA
AAGGTGCAGC ATTTATTCGA GATGGTAACA GGTGAAAGTA GACTTAAGGT TGATTTTGAT
GCGCCCTATG GCATCACCGG AGATGGAGAA TCTGTTCTAT ATATAGCAGA TACAGGTGTC
GGTCTTGTTC ATAGGTACGA TTTAATCAAC AGAGAGGTTG GTTATATTGT TCAAGCAGGA
GATGAAGAAA TGTCCAGCCC CGTTGGAGTG GCTGTTGATG GTGAAAAAAA TCTTTATGTT
GCTGATTCTG TGAATGCTAA AGTCTACAAA TATAATAAGA AGGGACAGTT TCTTAGGGAA
TTAAAATATG AAGCAGGATT TAAAAGGCCT GCCGGTATAG CGGTGAATAG CCGAAATGAA
AAATTTATTG TGGATGTGCT GGCACATAAA TTGTATATTT TTGGTGAGGA TGATCGATTT
ATACGTGACT TTCCCAAAAT GAAGAAGGGC GAAGAGCTTA ATTATCCGTC TAATGTTGCT
ATCGACCGTG CAGATAATGT TTATGTCACC GATTCGATGA ATTTTACCAT TAAGGTGTAC
AACCGTGAAG GGGATCTGCA AAGGACTATC GGTCAAATTG GCGATTCACC CGGTTCTTTC
GCGAGACCTA AAGGCATTGC GGTAGACAGC GATCAACAAA TATATGTGGT TGATGCAACC
CTTGACAATT TTCAGATATT CAATCAAAAA GGAAATCTTC TGCAACTCAT AGGCAAGAAC
GGTGGAGGTG CTGGCGAATT TTATCTGCCG AGCGGCATAT ATATTGACAA GCATGATCGT
ATATTTGTTA CCGACACCTA TAATCGGAGA ATTCAGGTAT TCCAATACCT GAAAGAAGGT
GGGAAACTGT GA
 
Protein sequence
MYSRLFLVLV FFTIFGCTTV EPLVLRDTKI DLAWPLPPNS PRIRFLRTIN GPDNIITAPG 
KVQHLFEMVT GESRLKVDFD APYGITGDGE SVLYIADTGV GLVHRYDLIN REVGYIVQAG
DEEMSSPVGV AVDGEKNLYV ADSVNAKVYK YNKKGQFLRE LKYEAGFKRP AGIAVNSRNE
KFIVDVLAHK LYIFGEDDRF IRDFPKMKKG EELNYPSNVA IDRADNVYVT DSMNFTIKVY
NREGDLQRTI GQIGDSPGSF ARPKGIAVDS DQQIYVVDAT LDNFQIFNQK GNLLQLIGKN
GGGAGEFYLP SGIYIDKHDR IFVTDTYNRR IQVFQYLKEG GKL