Gene GSU0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0043 
Symbol 
ID2688550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp52000 
End bp53241 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content63% 
IMG OID637124708 
ProductImpB/MucB/SamB family protein 
Protein accessionNP_951105 
Protein GI39995154 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGGG GAGAGCCCAG GCAGCGAACC ATCCTCCACA TCGACATGAA CGCCTTTTTC 
GCCAGTGTCG AACAGCAGGC GAACCCGAGC CTCCAGGGAA AACCCATCGC CGTTGTCGGC
TCCGGCCGCA CCGTGGTCAC CACCGCCTCC TATGAGGCCC GGGCCTTCGG CGTCAAGACC
GGCATGAACA AGTGGGAAGC GCTCCAGGCC TGCCCCCACC TCATCCTGGT CGTGGGTGAC
AACCGCAAAT ACACCCACAC CTCCACCCAG ATCAACCGGA TCTTCCGGGA CTTCACCCCG
GAGGTGGAAA CCTTCTCCAT CGACGAAGCC TTCCTCGACG TCACCGGCTC CCTGGCGCTC
TTCGGCTCCG CCGAAACCAT CGCCTGCCGG ATCAAGGCCC TGATCCGCCA TCGTTTCGGC
CTGACCTGCT CCATCGGCAT CGCCCCCAAC AAGCTCCTCG CCAAACTCGC CTCTGACATG
AAAAAGCCCG ACGGCCTCAC CATCATCCGC CCGGAAGAGG TGACCCGCCT CATGGAGATC
ATCCCCATCC AGGACCTCTG CGGCATCGGC GTCAAGACCA GAAAACAGCT CAACAGCCTC
GGCATCCAGA CCTGTGGCGA GCTGGGGCGC TTCCCGGTGG AGATCTTGCG GCGCACATTC
GGGGTGATCG GCGACCGGCT CCACCTCATG GGCAAGGGGA TCGACGATTC CCCCGTGGTC
CCCGTCGAGG AGGCCGAAGA GGTAAAGAGC GTCGGCCACT CCATGACCCT GGACAGGGAC
CTCACCGCTC GACGGGACAT CCTCAAATAC CTCCTCCAGC TCTCCGAGAT GGTGGGCCGC
CGGGCACGGC GCTACGGCGT TGCCGGCAAG ACGGTCCATC TCACCATCCG CTATGCCGAC
TTCACCACCG TGGGCAAGCA GCAGACCCGG AACCAGGCCA CTAACAGCAC AGAGGAAATT
TACGCCGAAG CGGTGAAGAT CCTCGACACC TTTGAGCTGC TGCAGCCGGT GCGTCTTCTG
GGGGTGCGGA TCACGAACCT GTGCTACCAG CGGGAACAGT TGCCGCTCTT CGAGAAGGAA
CGGAGGAAGG CCCTTGCCAC CGGGGCCATG GACGCGGTGA ACAACAGGTA TGGCGACTTC
TCTGTCACCT TCGGGAGCCT CCTTGATGAA GAGGAGAAGG GGAGCTTCGT CATCTCCCCG
GCCTGGCGGC CGGAGGGGAT CAGGAATGTG GAGGTGAAGT GA
 
Protein sequence
MSGGEPRQRT ILHIDMNAFF ASVEQQANPS LQGKPIAVVG SGRTVVTTAS YEARAFGVKT 
GMNKWEALQA CPHLILVVGD NRKYTHTSTQ INRIFRDFTP EVETFSIDEA FLDVTGSLAL
FGSAETIACR IKALIRHRFG LTCSIGIAPN KLLAKLASDM KKPDGLTIIR PEEVTRLMEI
IPIQDLCGIG VKTRKQLNSL GIQTCGELGR FPVEILRRTF GVIGDRLHLM GKGIDDSPVV
PVEEAEEVKS VGHSMTLDRD LTARRDILKY LLQLSEMVGR RARRYGVAGK TVHLTIRYAD
FTTVGKQQTR NQATNSTEEI YAEAVKILDT FELLQPVRLL GVRITNLCYQ REQLPLFEKE
RRKALATGAM DAVNNRYGDF SVTFGSLLDE EEKGSFVISP AWRPEGIRNV EVK