Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0054 |
Symbol | |
ID | 2688363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 70330 |
End bp | 71949 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637124719 |
Product | hypothetical protein |
Protein accession | NP_951116 |
Protein GI | 39995165 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02165] CRISPR-associated protein, GSU0054 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATGT ACTTTGTATT GACAATCGCC TTTCTCGACG GCCGTTTCCA CGGCAGGCGG GACGGTGACG AGCCGGAATG GCCTCCTTCT CCGTTAAGGG TTTTTCAAGC ATTGGTGGCA GCATCAGCAC GGATGAACGG CGGGGCATTG TCGTCAGATG GCAGTTCTGC GTTGCAGTGG CTTCAGGCGC AGCCTGCTCC TGCGATTGTC GCACCGTCAG GCATCCTTTC GGCTTCGCCA TATCGACTTT CAGTGCCGAA CAACGCCATG GACATCGTTG CGAGAGCATG GGGTCGAGGT AATGAAACCA ACTCGGGTGA CGCCAACCCT GCGACGCACC GCACGATGAA GAGCATCCGT CCCATTCATT TGATAGACAG CAGTTCGGTG CATTACCTAT GGCGAGTAAG CGAACCGGTT GCCCCAGAGA TAGCCGATTA CGTCCATGCA ATAGTGGAGA TGGCTCAAAA CATCAACGTG TTGGGCTGGG GGATCGATAT GGTGGTGGGA AATGGCGCGA TGCTTACCGA GGAGCAGATG GAGTCTCTTC CCGGTGAGCG TTGGTTGCCA CACGCTGAGA CCGGTGTGGA CGGCCTACGG GTACCGGTCA ACGGGACTCT GGCCGATTTG CAGGCGCGGC ATGAAGGATT TCTTTCGCGG CTGGCGCACG GCATCTTCAC TCCGCCTCCG CCGTTGGCTG TCTACGACAA GATCAATTAT CGGCGGGCCA TTGATCCACC CCCGAGGGCA ATTGCCGCAT TTTCCCTGCT GAAGACAGAT GCCAGCGGAT TCCGGGCCTT CGACACGGCT AGATGGGCGC TCACCGTGGC CGGGATGACG CGCCATGCCG CACGACGGGC GGCGCAAGGC GCGGGTTGGA AAGAATCAAG GATAAACGGC TGCATCCTTG GGCACGGCGA ATCAATCGGC GATGAAAAAC ATCTCCCGAC AGGACCACAA CGTTTTGCAT ATCTCCCCGT GCCGAGTCTG GAGGCCCGAG GTGCCGGCAA GGCACCAGTG ATTGGCAGCG TACGCAGGGT GATTATCACC GCTTTTGATG GGGCGTGTGG AGATGAAATC GACTGGGCGC GTCGCGCACT TTCGGGACAG ATGCTAGAGA AGATCAAGAA AGACGAGAGC GATGATAAAG AGCACGTGGC GTTGCTTTCC CTGCTCCCCG GATCCGACAA GGTGATTCGC TCGTACCTCC GACCGTCCTC TTCCTGGGCG ACCGTTACCC CCGTCGTTCT TCCGGGGTAT GACGACCCGG CGCACTACAG ACGCCGGCTC CAGCACGTCA CTAACTCGGA TGAGCAAAAG CGGCTACTGT GGCATCTCCA TGAACGGATC GACGGTTTGC TCCGGAAAGC CATTGTGCAG GCGCAGTTTC CCGAAATACT GGCGAAGAAT GCACTGATCG AATGGCGCAA GGTTGGCTAC TGGCGTGGCG CCGATCTGGC GGATCGTTAT GGCGTGCCGG ACCACCTCAA GAAATTCCCG CGTTACCATG TCAAAATCCA GTGGCGCAAT GATTGTCAGA TGCCGGTGCG GATTGATGGC CCAATCTGCA TTGGAGGTGG GCGATTCTAC GGCCTCGGTC TTTTCGCCCC CGTTGACTGA
|
Protein sequence | MSMYFVLTIA FLDGRFHGRR DGDEPEWPPS PLRVFQALVA ASARMNGGAL SSDGSSALQW LQAQPAPAIV APSGILSASP YRLSVPNNAM DIVARAWGRG NETNSGDANP ATHRTMKSIR PIHLIDSSSV HYLWRVSEPV APEIADYVHA IVEMAQNINV LGWGIDMVVG NGAMLTEEQM ESLPGERWLP HAETGVDGLR VPVNGTLADL QARHEGFLSR LAHGIFTPPP PLAVYDKINY RRAIDPPPRA IAAFSLLKTD ASGFRAFDTA RWALTVAGMT RHAARRAAQG AGWKESRING CILGHGESIG DEKHLPTGPQ RFAYLPVPSL EARGAGKAPV IGSVRRVIIT AFDGACGDEI DWARRALSGQ MLEKIKKDES DDKEHVALLS LLPGSDKVIR SYLRPSSSWA TVTPVVLPGY DDPAHYRRRL QHVTNSDEQK RLLWHLHERI DGLLRKAIVQ AQFPEILAKN ALIEWRKVGY WRGADLADRY GVPDHLKKFP RYHVKIQWRN DCQMPVRIDG PICIGGGRFY GLGLFAPVD
|
| |