Gene Gura_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3642 
Symbol 
ID5164255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4271395 
End bp4273029 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content53% 
IMG OID640551126 
Producthypothetical protein 
Protein accessionYP_001232368 
Protein GI148265662 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000393022 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAG AAGCAAAACA GGAGAAGGGA TTAACCTTCC AGATAAGCCC AGACGGCGGC 
AAGTTACTCG CCACGTACGA GCCCGTCGCA CAAAAAGTCC CAATCGACAC TGACTGGGTC
TGGCAGGCGC TTGATGCTCA AAATTTATCA GACCTGTTCA TTCTTGATGA CGCTCTTTCC
AATCTGGTAA AGAGGTGCGC CGTTGCCGCA GACCGATTCA CGATGCAGAT CGGGGAACGC
AGGGATGGCA CGCTGGCATT GACCGTTGCT CCAGATCTGA TGTCGGCGTA TATCACCATC
ACATCTGCAT ACGGCGGCAA TGCCGTCACC TCCGAGCAGA TACTCCAGGC TCTCCAGGAA
CAAAAAATTG TCAGCGGGAT TCTCCATGAT GAAATAGAAA AGGCTGTTGG TGAAAGGGAG
GTCCTGAAAA GGGAAATTGC CAAAGGCCGT TCGCCGCAAC CAGGAGAAGA TTCCCAGTTT
ATCAGCCTGA TCCCGGAAAT GAGGGAAAGA TGTCCCCTGG CAGACGACAC GGACAACGTT
GATTACCGCA ATCTGGGCGG CATCGTCAGC GTCAAATCGG GCGACCCTTT GATGCGGCGC
TATCCGGCGA CCAAAGGGAC GCCTGGCGAA AACATTCTGG GAACTCCGTT ACCAACAACC
GACGGCAACG ACATAGCATT TACTCCGAAC CTCAGTGGCA CCGTCTTCGC GGAAAACGAC
AGTGATCTGC TTCTGGCAGC CATTTCCGGG CTACCCGTCC AGGTGGATCA CGGCATCATT
GTCGAACCGG TCATCAATCT CAAGAATGTC GACCTCTCTT CCGGTAATCT GCACTTCGAA
GGCACGGTAA ACATCGCCGG TGATGTAAAA GCCGGGATGG AAGTAAAAGC AACGGGTGAC
ATCATTATCG GCGGCGTCGC CGAAGCCGCA AAGATCGAAG CCGGCGGCAA CATCGAAATA
AAAGGGGGGG TAATCGGCCA GAGAGAAGTG AGAAACCAAA AAGGCGAACT GAACCCCGAC
ATTTCATACG TTCATGCCGG CGGCTCGGTC ACCGCGCAAT TTGTGGAGAA CGCCTGCATT
ATCGCCGGCC GCGACATCAA TATCCGTGAA GTGGCTATGA AGAGCGAACT CACCGCCGGA
AACGAAGTAA TAGTCGGAGA GCAAGGAATG AAAAAGGGGC ACATCATCGG CGGCGTCTGT
CGAGCCACCA CCCTTGTCCA TGCCATCATT GCCGGCTCCC CCGCCAACGT CAGCACGAGA
ATTGAAGTCG GTGTCGATCC GTCCATCAGC GAAAAGCTCT CCATTGTCAA GCAGCAACTG
GAGGAAAAGG AAAAACGACA GGAAGAAACC GCCAAGACAT TGGCATACAT CCGCGACAAC
CCGGCCAAGG TTGATGCAGG AATGGCCAGA CTGAAGGAAC GGGTCTACAA CATCCAGCAA
GCAGAAATCA CAGAACTGAC CGGACAAAAA AAACGCCTGC AAAAACGGCT TGAACTGGTC
AACAACGCCA GAATAGAAAT AGAACGAACC GTATACTTCG GTGTCCATCT CATGGTTGGG
GACAAGACCC TCCTGATCGA AGATGACCTG GAAAGCAAAA CTTTCACTCG CGGAGAAGAA
GGAATCGCAT ACTGA
 
Protein sequence
MAEEAKQEKG LTFQISPDGG KLLATYEPVA QKVPIDTDWV WQALDAQNLS DLFILDDALS 
NLVKRCAVAA DRFTMQIGER RDGTLALTVA PDLMSAYITI TSAYGGNAVT SEQILQALQE
QKIVSGILHD EIEKAVGERE VLKREIAKGR SPQPGEDSQF ISLIPEMRER CPLADDTDNV
DYRNLGGIVS VKSGDPLMRR YPATKGTPGE NILGTPLPTT DGNDIAFTPN LSGTVFAEND
SDLLLAAISG LPVQVDHGII VEPVINLKNV DLSSGNLHFE GTVNIAGDVK AGMEVKATGD
IIIGGVAEAA KIEAGGNIEI KGGVIGQREV RNQKGELNPD ISYVHAGGSV TAQFVENACI
IAGRDINIRE VAMKSELTAG NEVIVGEQGM KKGHIIGGVC RATTLVHAII AGSPANVSTR
IEVGVDPSIS EKLSIVKQQL EEKEKRQEET AKTLAYIRDN PAKVDAGMAR LKERVYNIQQ
AEITELTGQK KRLQKRLELV NNARIEIERT VYFGVHLMVG DKTLLIEDDL ESKTFTRGEE
GIAY