Gene GSU1616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1616 
Symbol 
ID2687433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1769602 
End bp1770831 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content55% 
IMG OID637126296 
ProductImpB/MucB/SamB family protein 
Protein accessionNP_952667 
Protein GI39996716 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC ATCGAACCAT TCTCCATCTG GACATGAACG CCTTTTTTGC CTCGGTTGAG 
CAACAAGCAG ACCCATCCCT CCAAGGAAAG CCAGTTGCGG TCATCGGGTC AGGCGCGCGC
ACCATCATCA CTACCGCATC CTACGAGGCA CGCGCCCACG GGGTCGGCAC GGGCATGACA
ACCCACGAAG CAAAACGGCT CTGCCCGGAA TTGATCCTCG TTATCGGCAA CAACCGGGAA
TACACCCGTA TTTCAAAACT GATCTTCGAA ATGCTCGGGC AATTCACCCC CCTTGTGGAA
GTCTTTTCCA TTGACGAAGC CTTTCTCGAC TTGACCGGTT CGCTTCGACT TTTCGGCCGC
GCCGACGTGA TTGCCCGCGA GATCAAAGCC CGCATAAGGG ACCGATTCGG CCTCACCTGC
TCCATCGGCA TCGCTCCCAA CAAGCTTTTG GCAAAGCTGG CCTCGGACAT GCAAAAGCCC
GATGGTCTCA CCATCATCCC CCCCGAAAAC GTGGCTTCAC TCATGAAAAC GACCCCGATC
CGCAGCCTCT GCGGAATAGG CCCATCCACT GAACAACGAC TGAAATACCT CGGCATTTCA
ACCTGTAACG ATCTGGGACA CTTCAGCTCC AAGATACTTA CCCGACATTT CGGTGCCATG
GGAGCACGAC TCAAACAGAT GGGACAGGGC ATTGACGACT CTCCCGTAAT ACCGACCACA
ACGGAAGAAG AAGTTAAAAG CGTCAGCCAC AGTACAACCC TGGACCACGA CCTCACCAAC
AAGGACGACA TACATAAGTA CCTGCTTATG TTATCGGAGA TGGTAGGACG CCGCGCGCGC
CGCTACAAAA CCAAAGGGAA AACAGTTACC CTAACAATAC GGTATGCCGA TTTCACCACG
TTCAGCCGCC AGGAAACCCT CCCCTCCCCA TTGAATCGAA GCGGCGACAT TTATCGAGCC
GTCACTGCAC TGCTCGACAC ACTCCAATTA ACGCAACCAA TACGACTCCT GGGCGTTGCC
ATCAGCAACC TTGTACATGC CTCGGAACAG CTCCCCCTCT TCCCGGAAGA ACGGCGGCTC
ACAACCCTCT CGCAAGCCAT GGATCAAGTC AATGACCGGT TCGGAGAAAC CACCATCACC
TTTGGCACCT TGCTGGGACA AGACAACTGC TCGCACGTTA TCGCCCCTGC CTGGAGACCC
GCTGGCATAC GTGACGTAAA AGTAAAATAA
 
Protein sequence
MSNHRTILHL DMNAFFASVE QQADPSLQGK PVAVIGSGAR TIITTASYEA RAHGVGTGMT 
THEAKRLCPE LILVIGNNRE YTRISKLIFE MLGQFTPLVE VFSIDEAFLD LTGSLRLFGR
ADVIAREIKA RIRDRFGLTC SIGIAPNKLL AKLASDMQKP DGLTIIPPEN VASLMKTTPI
RSLCGIGPST EQRLKYLGIS TCNDLGHFSS KILTRHFGAM GARLKQMGQG IDDSPVIPTT
TEEEVKSVSH STTLDHDLTN KDDIHKYLLM LSEMVGRRAR RYKTKGKTVT LTIRYADFTT
FSRQETLPSP LNRSGDIYRA VTALLDTLQL TQPIRLLGVA ISNLVHASEQ LPLFPEERRL
TTLSQAMDQV NDRFGETTIT FGTLLGQDNC SHVIAPAWRP AGIRDVKVK