Gene Gura_2599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2599 
Symbol 
ID5163979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3009091 
End bp3011271 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content64% 
IMG OID640550095 
Producthypothetical protein 
Protein accessionYP_001231349 
Protein GI148264643 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGCT ATCGACGGGT TATTTTCGGC ATATTCCTGA CGCTGACCGT CGCATGTCCC 
GCCATGACGG TCTGCGCCGT CGCAGCCGAC CTGAAAAAAG CGCCGCCCGG CATAGCAGAG
CGCCTGGATC AAGGCGCCAC CCAGAAACTG ATCGTCCTGT TCGATGACAG CGCCATTGAG
CGGGAAGTTG CTGCAAACCG AAGCCGGACA GGCATTGAAC ATGATGATGA CGCCATTCTG
GCATTCAGAG CTTCGCGTTA CAGGGAGCTG AAAAGCCGGG CAGAGCCTGC GGAGCTGAGC
GGCGAGGTCG AAACGGTCAA AGACTACAGT CATCTGCCGA TGTCATTCAA GCGTTTCAAA
AACCGCCGTT CGCTGGAGAA ATTCCTGGCT CTCCCTGAGG TGACGGCGGT CTACGAGAAC
CGGCCGATTT ACCCGACCCT CGCCCAGAGC CTGCCGTTAA TCAAACAGCC GGCAACAGCC
GGTCTGGGGC TGACCGGGAG CGGCGCCACA GTGGCGGTAA TCGATACGGG CATCAACTAT
ACCCTGGCCG CTTTCGGTTC CTGCACGGCG CCGGGGACCC CCGCCGGCTG CCGGATAGCC
GCCTCGGTGG ACATTACCGG CAACAACGTT ACCCTGAACA CCGACCCAAA CGGCCATGGC
ACCAACGTGG CGGGGATTGT CTCAGGAGTG GCGCCGGGCG CCCGGATTGC CGCGATCAAC
GCCTTTACCG GCGGCGCGTC TTCCATCGCC TTGATCATTG ACGGCATCAA CTGGGCCATA
GCCAACCGGA GCGCCTATAA CATCGTCGCC ATCAACATGA GTCTGGGGGA CGGCGCCAAA
TATACGGCTC CCTGCGGCAA CAGCCACACA AACCCCTTCG TCACCCCGGT GAATAACGCC
CGGGCCGTCG GCATCCTTCC GGTGGCCGCG TCGGGCAACG AGGGGTACTC GAACGGCATT
GCCAGTCCGG CCTGCACTCC GGGGGTCGCC TCTGTGGGGG CTGTCTACGA CGCCAATGTC
GGCGGGCGGC AGTGGTCAAC ATGTACCGAC AGTACCACCG CAGCCGATCA GGTCGCCTGT
TTCTCCAACA GCTCCAGCTT CGTGACCATG CTATCCCCCG GCGCGATCAT CACCGCCGCC
GGCATCGGCA TGGGTGGTAC ATCACAGGCA TCACCCCATG TGGCAGGGGC AGCCGCAGTA
TTCCGCTCGG CTTTTGCCGG CGAGACCCCG GATCAGACGC TCGCGCGCCT GACCGGCAGC
GGTGTGCCCG TCACCGACCC GCGTAACGGC GTCGTCAAGC CGCGCCTCAA CCTGCTGGCC
GCCTTGGGAG CGCCCATCAA CGACAACTTT TCCGCACGCC AGGCGCTCAG CGGCGACACC
GGCCAGCTCA CCGGCAACAA CGCCAACGCC ACCGTCGAGC CGGCCGAACC GGCCCACGCT
GGCAACAGCG GCGGCAGATC GGTCTGGTGG AGTTGGACGC CTTCCGTTTC CGGTCCGGTT
GCAATTGATA CACAAGGAAG CAGCTTCGAT ACCCTGTTGG CGGTCTACAC CGGTACGGGC
GTTACCGCCC TCACCCCCAT TGCAGCAAAC GACAACGACG GCGCCCCGGG AAACACCAGC
GGCCTCTCCT TTGTTGCCCA GGCGGGAATA GAGTACCTCA TTGCCGTGGA CGGCTTCAAC
GGAGCCTTCG GCAGCACGGT CCTCAACTGG GGGCAGGCCC CCAGCGCCGA TCTGTTCGTT
ACCATGACCG GCTCACCCGA CCCGCTGGCG CCGGGTGAGA CCCTGACCTA CTCCATCTCG
GTGGCAAACA GGGGACCGGC AACCGCCGTC AACACGACCC TGACCGATAC CCTGCCGACT
GGGGTGAGCG TCGTCTCTAC CTCTGCCGGC TGCACGACGG CAGGTGGGAT CGTCACCTGC
AACCTCGGCA GCATGGCAAG CGCCACCGCC GTCGGCCTCC AGATTGCCGT TTCGCCCGCC
TCAGCCGGGA CATTGACCAA CACCGTGAAC GTTGCTTCAG ATACCTACGA GCTCGCCCCT
GCCGACAATT CAGCCGGCAT CGCCACAACG GTTTCGCTTC CGCCGCCGGC AGTTCCGGCG
CTCTCCCCTT GGGGGATTGC GCTGGCCGCC TGCCTGCTCT CAGGCTGGCA GCAGCATCGA
AAACGGCGCA AACCCAATTA G
 
Protein sequence
MRRYRRVIFG IFLTLTVACP AMTVCAVAAD LKKAPPGIAE RLDQGATQKL IVLFDDSAIE 
REVAANRSRT GIEHDDDAIL AFRASRYREL KSRAEPAELS GEVETVKDYS HLPMSFKRFK
NRRSLEKFLA LPEVTAVYEN RPIYPTLAQS LPLIKQPATA GLGLTGSGAT VAVIDTGINY
TLAAFGSCTA PGTPAGCRIA ASVDITGNNV TLNTDPNGHG TNVAGIVSGV APGARIAAIN
AFTGGASSIA LIIDGINWAI ANRSAYNIVA INMSLGDGAK YTAPCGNSHT NPFVTPVNNA
RAVGILPVAA SGNEGYSNGI ASPACTPGVA SVGAVYDANV GGRQWSTCTD STTAADQVAC
FSNSSSFVTM LSPGAIITAA GIGMGGTSQA SPHVAGAAAV FRSAFAGETP DQTLARLTGS
GVPVTDPRNG VVKPRLNLLA ALGAPINDNF SARQALSGDT GQLTGNNANA TVEPAEPAHA
GNSGGRSVWW SWTPSVSGPV AIDTQGSSFD TLLAVYTGTG VTALTPIAAN DNDGAPGNTS
GLSFVAQAGI EYLIAVDGFN GAFGSTVLNW GQAPSADLFV TMTGSPDPLA PGETLTYSIS
VANRGPATAV NTTLTDTLPT GVSVVSTSAG CTTAGGIVTC NLGSMASATA VGLQIAVSPA
SAGTLTNTVN VASDTYELAP ADNSAGIATT VSLPPPAVPA LSPWGIALAA CLLSGWQQHR
KRRKPN