Gene Gura_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4042 
Symbol 
ID5165929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4700105 
End bp4701391 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content50% 
IMG OID640551521 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001232759 
Protein GI148266053 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGT GGTCTACTGT TCCGTTTGGT CAAATTGCCA AAAAGATTGT AAACGGTGGA 
ACCCCGTCGA CTGATATAGA CCGTTACTGG AATGGAAACA TACCTTGGAT AACTGGAGCC
GACTTCACGC CATCTGGCAT CGGGGAATTC AGGCGCTTCG TCTCGGAGGA GGCTGTCAGG
CAGTCGGCTA CAAATGTGAT TCAACAAGGT CAATTGTTAT TGGTCACCCG CACAGGAGTC
GGAAAAATTG CCATTGCCCC ATGTGACATC GCCATCAGCC AAGACATCAC TGGAGTTTAC
GTCGATGATA ATCAGGTTGC TACATCGTTC CTTTTTCATC GAATGCGTCA GGGAGTGGAA
GACCTCAAAA AACTGAACCA AGGAACGAGC ATTAATGGGA TAATCCGCTC CGACCTCGTT
GCTTACTTGG TGGAGTTGCC AGCACTTCCT CAGCAGCGCC GCATCGCTGA AATCCTCTCA
ACACTGGACG AAACAATTGA GCAGACCGAG GTGCTGATTG CGAAGATGCA GCAGGTCAAG
GCTGGGCTGA TGCACGACCT GTTCACCCGT GGCGTCACCC CCGACGGCCA CCTTCGTCCC
ACACGCGAAC ATGCGCCCGG CCTCTACAAA GAATCTCCGC TTGGGTGGAT TCCGAAGGAG
TGGGAGGTCG AAAGACTGGG AAACATCTTA CGTAAATGCG GTGGATACCT TCAGACTGGG
CCTTTTGGCA GTCAGCTCCA TGCTCATGAA TATCAGGCCG AAGGTGTTCC AGTCGTGATG
CCCCAAGACA TCAACAATGG ATTGATTGGC ACAGAGAATA TCGCCCGAAT TCACGAGGCA
CGTGCCAATG ATTTAGCGCG GCATCGAATG AGTCTTGGTG ACATGGTAAT TGCCAGACGA
GGCGATCTTT CACGTGCAGC AGCAATCAGA GAGTCAGAGC AGGGTTGGGT TTGTGGGACA
GGGTGCTTCT TACTACGCTT AGGACAGAGC GCCTTGACGG CAGACTTTGC AGCTCAAGTT
TACCGACAAG ATTTTGTGCA GCGGCAGATC GTAGGCAGAG CCGTTGGAAC CACAATGCCG
AGTTTAAACA ACTCGGTTAT GGAAGGGTTG TTTTTTCCTT TTTGTGATTT AGATGAACAG
GTGCGAATTG TTGAGCGGCT GGAATGGATG GAAATGAATA TTTGTGCTCT TAATGAAAGT
CAGTCCGTGA ATCGACTAAT CAAACGCGGC CTCATGCACG ACCTCATGAC AGGTAACGTG
CAAGTGTTTG AACGTACCGA AATTTAA
 
Protein sequence
MSEWSTVPFG QIAKKIVNGG TPSTDIDRYW NGNIPWITGA DFTPSGIGEF RRFVSEEAVR 
QSATNVIQQG QLLLVTRTGV GKIAIAPCDI AISQDITGVY VDDNQVATSF LFHRMRQGVE
DLKKLNQGTS INGIIRSDLV AYLVELPALP QQRRIAEILS TLDETIEQTE VLIAKMQQVK
AGLMHDLFTR GVTPDGHLRP TREHAPGLYK ESPLGWIPKE WEVERLGNIL RKCGGYLQTG
PFGSQLHAHE YQAEGVPVVM PQDINNGLIG TENIARIHEA RANDLARHRM SLGDMVIARR
GDLSRAAAIR ESEQGWVCGT GCFLLRLGQS ALTADFAAQV YRQDFVQRQI VGRAVGTTMP
SLNNSVMEGL FFPFCDLDEQ VRIVERLEWM EMNICALNES QSVNRLIKRG LMHDLMTGNV
QVFERTEI