Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2075 |
Symbol | |
ID | 2687924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2279884 |
End bp | 2281341 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637126766 |
Product | subtilisin |
Protein accession | NP_953124 |
Protein GI | 39997173 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.734666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTATC TGCTCGCCGT TACGGCTCTT TTTATCCTGC TGCCTCCTCT GGGTGACGCT TTTGCCGCCG ACAGGAAGGT CATTGTAGGA TTTCGCTCCA CGGTGGAAAA AAGGGACGTT CGCCACAAAG AGAAGGTCTA TCGGCACGGC GGTCGCGTCA AGAGGACGCA CTCTGCGGTA AACGCCATAT CGGCAACCCT GTCCGAAGAA GAGATCGAGC GCCTGAAGAA AGATCCCGAC GTTGCCTACG TGGAGACGGA CTTCGTGCTT TCGTCCATTG AACCGGCCGC GGCTTCGCCG GAGGAGTATG CCGCAGCGTG GGGCGCGCAG CACATCGGTG CCGACCAGGT TGCGGCGGCC GGCATCACCG GCGCGGGGGT CCGGGTGGCA GTGCTCGATA CGGGCATTGA TTACACGCAT CCCGATTTGA AGGACAACTA CAAGGGGGGG TACAACTTTG TAGCAGACAA CAACGATCCC ATGGACGACG CATACTCCCT CAGCCATGGC ACCCACGTGG CCGGGATCAT CGCCGCCCGC AACAACGGTA CCGGTGTGGT CGGCGTCGCG CCCGCAGCGG AGCTCTATGC GGTCAAGGTG CTTAACGGCG GCCTCGGCGG AGAGTTGAGC GACATTATCG CCGGCATCGA GTGGGCCATC GAGAACCGGA TGCAGGTCGT CAACATGAGC TTCGGCAGCA TGGAGTTCTC CCAGGCGCTC AAGGATGTCT GCGATCTGGC CTATCGATCG GGAATCGTGC TGGTGGCTTC GGCCGGCAAT TTCTCGCCGG GGGCCGTACT CTATCCCGCC GCTTTCGATT CGGTCGTGGC GGTTTCCGCC ACCTACCAGG ACGACACGCT TGGAACGTTT TCCAGTTACG GTCCCCAGGT CGAATTGGCC GCACCGGGGC ACAATATCTA TTCCACGGCG ATCGGCGGCG GCTACCGCAT CAACTTCGGC ACATCGCAGG CCGCACCCCA TGTCACCGGT GCGGCGGCGC TTCTCATCTC GGCCGGCACC ACCGACACCA ACGGTAACCG CTCCGTTGCC GACGAGGTCA GGCAACGACT TGCGGCAGCC GCCCGGGACC TGGGTGAAAT GGGCAGGGAC ATCTACTATG GTTACGGCCT CGTTGACGTA GCCAAGGCCG TTCTGTCGCC GCCGAACATC GAGACGGTGG TCACCACGCC GCGGGGGAAA CGGTGTGCAT CTGCTGCAGC CCTTGATCTG GCGAACTCGA CCTACCGGCT GGACATTACG GGAGCGACGT TGCAGGCGCT TGAAGTCCGC GTCGGGAGCG CCGACGGGCC TCTTGTGAGC TTTATCCGCT TCCGGCGTGG GACTGAAGGG GCGGTATCGT TCAGCTACAC GGCATCCGGC ACTGTCAGGC TGGTGCTGAT CCCCCACGGC AAACCGGGAA CATCGGCGCG GGTGACGGCC GTTCCGGAGC AGCTGTGA
|
Protein sequence | MRYLLAVTAL FILLPPLGDA FAADRKVIVG FRSTVEKRDV RHKEKVYRHG GRVKRTHSAV NAISATLSEE EIERLKKDPD VAYVETDFVL SSIEPAAASP EEYAAAWGAQ HIGADQVAAA GITGAGVRVA VLDTGIDYTH PDLKDNYKGG YNFVADNNDP MDDAYSLSHG THVAGIIAAR NNGTGVVGVA PAAELYAVKV LNGGLGGELS DIIAGIEWAI ENRMQVVNMS FGSMEFSQAL KDVCDLAYRS GIVLVASAGN FSPGAVLYPA AFDSVVAVSA TYQDDTLGTF SSYGPQVELA APGHNIYSTA IGGGYRINFG TSQAAPHVTG AAALLISAGT TDTNGNRSVA DEVRQRLAAA ARDLGEMGRD IYYGYGLVDV AKAVLSPPNI ETVVTTPRGK RCASAAALDL ANSTYRLDIT GATLQALEVR VGSADGPLVS FIRFRRGTEG AVSFSYTASG TVRLVLIPHG KPGTSARVTA VPEQL
|
| |