Gene GSU1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1981 
Symbol 
ID2688163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2171409 
End bp2172638 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content58% 
IMG OID637126672 
Producthypothetical protein 
Protein accessionNP_953030 
Protein GI39997079 
COG category 
COG ID 
TIGRFAM ID[TIGR03016] uncharacterized protein, PEP-CTERM system associated 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGG CAGCCCCGCT TCTCATCTGC TGTGCATGCG CCGCGCTCGT TCCGTTTCCA 
GCCAATGCCG CTGATTTCCG GTTTACCCCC AACATAACCG TGAGCGAGGA GTTCACCGAC
AACGTCTTCG AGACGGCCGA GGGGAAGCGC TACGACTTCA TTACCCGCCT GCTTCCGGGC
CTTTCACTTG ATTATAAGGC CCCAGTTCTG GACCTCTCCG TGGCGTACAA TTACGACTAC
CGCTACTATG CACGCAACAG CCGCAGCGAC GACACCACCC ACAACCTTGA CGCCCGTGGC
CTGGCGCGAA TTGTCGACGA GTTCCTCTTT CTGGAAGCCA GCGATACCTA CAAACGTGTC
TCCCTCGACG TTTCCCGCGA TACCAGCAAC GAAAGCCTGT TCCGCAACCA GTCGGACCAG
AACATCGTCA CCGCTTCGCC CTATTTCGTC ATCAGCACCA TCCCCCACTA CACCATTAAG
GGTGGGTATC GCTACACCAA TACCTGGTAC AAGGATCCAT CCGGCATCGA CAAGACCGAG
CACCGCGCCT TTGCCGACCT CAGCTATGAG GTGACCAAGC AGCTTTCCCT GACCGGCACG
TATTCCTACG TGATCGAGGA CACTGCGGAA TCGGACCTGA ACCGCCACGA GGCCATGGCG
GGCGGACGCT ATGAGTACGC CGACAAGAGC TTCGTCTTTG CCAATGGCGG TGCTTCCCGC
ATTTCCTACC GGGGAGGGGA CACCTTCACC AATCCCATCT GGAATGCCGG GCTTACTCAT
GCCTTCGATT CCTTCACCGT TACGCTGTCC ACCGGCGTCC AGTACAGCGA AGACCCCCTG
CGCGCCTCCA CGGAGGAGAC CTTTTACTCG GCCGTATTCG ATATGCCCCT GAAGCGTGGG
GCGCTGAACC TGAACGCTTC CTACTCTGAC TTCGTCAATA CCGCAACCGA TGAGCGGGAA
ACCAGACGGT ACGGCGGTGG ATTCACCCTG CACCATGAGC TCACTCAACG TCTTACCGGC
ACCCTCGGCT TTGCCGCCGA ACGTTACGAA CAGAACCTGC TGGATGCCTA TACCCGCAAA
TTCTTCGTGG ATGCCGGCCT TCGCTACGAA CTGGGCGAAG GGTTTTCCCT GGGGCTTACC
TACCGCTACA TCGATTACCA CTCCCCCCGG ATCGTAGCAG ACAACTACAC CGTAAACCGG
GCGATGGTGG AGATCAGGAA GGTCTTCTAG
 
Protein sequence
MKQAAPLLIC CACAALVPFP ANAADFRFTP NITVSEEFTD NVFETAEGKR YDFITRLLPG 
LSLDYKAPVL DLSVAYNYDY RYYARNSRSD DTTHNLDARG LARIVDEFLF LEASDTYKRV
SLDVSRDTSN ESLFRNQSDQ NIVTASPYFV ISTIPHYTIK GGYRYTNTWY KDPSGIDKTE
HRAFADLSYE VTKQLSLTGT YSYVIEDTAE SDLNRHEAMA GGRYEYADKS FVFANGGASR
ISYRGGDTFT NPIWNAGLTH AFDSFTVTLS TGVQYSEDPL RASTEETFYS AVFDMPLKRG
ALNLNASYSD FVNTATDERE TRRYGGGFTL HHELTQRLTG TLGFAAERYE QNLLDAYTRK
FFVDAGLRYE LGEGFSLGLT YRYIDYHSPR IVADNYTVNR AMVEIRKVF