Gene GSU2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2075 
Symbol 
ID2687924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2279884 
End bp2281341 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content63% 
IMG OID637126766 
Productsubtilisin 
Protein accessionNP_953124 
Protein GI39997173 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.734666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTATC TGCTCGCCGT TACGGCTCTT TTTATCCTGC TGCCTCCTCT GGGTGACGCT 
TTTGCCGCCG ACAGGAAGGT CATTGTAGGA TTTCGCTCCA CGGTGGAAAA AAGGGACGTT
CGCCACAAAG AGAAGGTCTA TCGGCACGGC GGTCGCGTCA AGAGGACGCA CTCTGCGGTA
AACGCCATAT CGGCAACCCT GTCCGAAGAA GAGATCGAGC GCCTGAAGAA AGATCCCGAC
GTTGCCTACG TGGAGACGGA CTTCGTGCTT TCGTCCATTG AACCGGCCGC GGCTTCGCCG
GAGGAGTATG CCGCAGCGTG GGGCGCGCAG CACATCGGTG CCGACCAGGT TGCGGCGGCC
GGCATCACCG GCGCGGGGGT CCGGGTGGCA GTGCTCGATA CGGGCATTGA TTACACGCAT
CCCGATTTGA AGGACAACTA CAAGGGGGGG TACAACTTTG TAGCAGACAA CAACGATCCC
ATGGACGACG CATACTCCCT CAGCCATGGC ACCCACGTGG CCGGGATCAT CGCCGCCCGC
AACAACGGTA CCGGTGTGGT CGGCGTCGCG CCCGCAGCGG AGCTCTATGC GGTCAAGGTG
CTTAACGGCG GCCTCGGCGG AGAGTTGAGC GACATTATCG CCGGCATCGA GTGGGCCATC
GAGAACCGGA TGCAGGTCGT CAACATGAGC TTCGGCAGCA TGGAGTTCTC CCAGGCGCTC
AAGGATGTCT GCGATCTGGC CTATCGATCG GGAATCGTGC TGGTGGCTTC GGCCGGCAAT
TTCTCGCCGG GGGCCGTACT CTATCCCGCC GCTTTCGATT CGGTCGTGGC GGTTTCCGCC
ACCTACCAGG ACGACACGCT TGGAACGTTT TCCAGTTACG GTCCCCAGGT CGAATTGGCC
GCACCGGGGC ACAATATCTA TTCCACGGCG ATCGGCGGCG GCTACCGCAT CAACTTCGGC
ACATCGCAGG CCGCACCCCA TGTCACCGGT GCGGCGGCGC TTCTCATCTC GGCCGGCACC
ACCGACACCA ACGGTAACCG CTCCGTTGCC GACGAGGTCA GGCAACGACT TGCGGCAGCC
GCCCGGGACC TGGGTGAAAT GGGCAGGGAC ATCTACTATG GTTACGGCCT CGTTGACGTA
GCCAAGGCCG TTCTGTCGCC GCCGAACATC GAGACGGTGG TCACCACGCC GCGGGGGAAA
CGGTGTGCAT CTGCTGCAGC CCTTGATCTG GCGAACTCGA CCTACCGGCT GGACATTACG
GGAGCGACGT TGCAGGCGCT TGAAGTCCGC GTCGGGAGCG CCGACGGGCC TCTTGTGAGC
TTTATCCGCT TCCGGCGTGG GACTGAAGGG GCGGTATCGT TCAGCTACAC GGCATCCGGC
ACTGTCAGGC TGGTGCTGAT CCCCCACGGC AAACCGGGAA CATCGGCGCG GGTGACGGCC
GTTCCGGAGC AGCTGTGA
 
Protein sequence
MRYLLAVTAL FILLPPLGDA FAADRKVIVG FRSTVEKRDV RHKEKVYRHG GRVKRTHSAV 
NAISATLSEE EIERLKKDPD VAYVETDFVL SSIEPAAASP EEYAAAWGAQ HIGADQVAAA
GITGAGVRVA VLDTGIDYTH PDLKDNYKGG YNFVADNNDP MDDAYSLSHG THVAGIIAAR
NNGTGVVGVA PAAELYAVKV LNGGLGGELS DIIAGIEWAI ENRMQVVNMS FGSMEFSQAL
KDVCDLAYRS GIVLVASAGN FSPGAVLYPA AFDSVVAVSA TYQDDTLGTF SSYGPQVELA
APGHNIYSTA IGGGYRINFG TSQAAPHVTG AAALLISAGT TDTNGNRSVA DEVRQRLAAA
ARDLGEMGRD IYYGYGLVDV AKAVLSPPNI ETVVTTPRGK RCASAAALDL ANSTYRLDIT
GATLQALEVR VGSADGPLVS FIRFRRGTEG AVSFSYTASG TVRLVLIPHG KPGTSARVTA
VPEQL