Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2164 |
Symbol | |
ID | 2685888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 2377156 |
End bp | 2380512 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637126856 |
Product | hypothetical protein |
Protein accession | NP_953213 |
Protein GI | 39997262 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.736314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTTA CCACTACCAT CAACAGCTCG TTTTCCCGCA CCCGGGGGAA TGAACTGAAC AAGGGGTACT ACCCGACCTT TCCATCTGAC GTTCAGACGG TACTTCGTAT CCTGGAGCCG TCATTCGGTT GGAAAAGCCG GCGCTACAAT GCCCTGAAAA CCATATCAGT CCTTGACCCC TGCGCGGGTG AAGGCGAATT CCTGACCACA GTGACTCGCT GGCTCAAAAG GCGCTTTGCC GCCTCCAACG GTTCACCTCA CGAAATCCTC TCCTATGCCG TTGAACTGGA TGCCGGCAGG TTCGCCAGGA TTCACGGGAC AACCCAGAAA CTCAACTCGT CGTTCTTCGA CGTGGAGCTT TCCGGCAAGT TCAATCTGAT CTTGCTGAAT CCGCCTTACA ACAAGGCCGC CGGCAACGAA CTGGTCACCT GGATGGAAAA GACAGCGCCC CTGCTGGCCT ACGAAGGAGT GATGGTTCTC ATCATTCCCG AGTATGAACT GAAAGGGAGG CTGCTGGACA TCATCCAGGG TACGTTTACC TTTGCCTACG CCTTCCAAAG CGAGGAATAC GCCCGCTTCA AGCAGGTTGT CGTATTTCTG GCAAAGAATC AGGACAACAG CAGCGAAATC TATCATCAGT ATGTCCGTTA TCAGAGTTGG CCCGCTGCCA ACCTGGGGGA TGGAGCGGTA CGGGGGAAGT ACACCGTGCC GGGCAAACGC CAGACTCTTT CCCTTGGCGG CAGCAATGGC AGCAACCGCC CGCTCATGAC GGCCCGCGAC CTCACGGAGT TTTACCGGGA GTGCGAGGAC CGGCTCGACA AGGCGGCGGG CATGGTACTG GACAGGCAGT ATCCGTCGTC TTATGACACC AGCATCCAGC CCATCTCAAC CCTGCGAACT GCGCACGCCG TCCAACTGGC AGCCATGAAC AGCCAGATCG AATCTATCAC CATCAACTGC GACTTCTTCC TGGCTAAATT CATGGTCATC ACCAAAAACG AGACCTTCAA AGATCCGGAG AACAACTGCG AGACGATGGT CTGCAAGCCC ACCGTCGAAG CATTTCTGAT GGACCGCCAC GGGTACGTGA AACCGGCCCG GGAACACGGG TTCGATTATT ACGAACTCAA CAGCCGGCTC TCATCGGTGC TTCTGCAGAA ACTCTCACGG CTCTACCAGC CCCTTCACGA AATCGGCCAG GACGAAGCAT ACCTGGTTGA TGAACTGCAG GGGATCGGTC TGCTCCCCCC GCAACGGGAA GCGGTCAAGG CCGTCATCAA AGCCTACCGC TCAGGACGCC GGGGTATCGG CATTCGTGCC AATACCGGCA CCGGGAAAAC ATGGATGGCG AAAGCGGTGA AGTACCTGAC CAATGCGAAA CGCACGGTGA TGGTATCGGA ACCGCAGCTT ATCCCCCAGT TGGCACGGGA ATACGCCAAC GAGGGGTTCA ATGTGCATGT CATCGACTCC TGGGAAGTCT TGAAGGAACT TGCCGCAACC AGGCCGCACG GTCTCTACCT GATCGCTTAC ACACGGCTCA GGATGCACCC GAAATACCGG CTATGCATCA GGAGTCGTAA AACCATCGTC AAGAAGGATG GCAATAGCTC CGTGGAATTC ACGGAGGTAT GTCCGTCCTG CCGTTCGCCT CTGACCGAGA AGATCCGCAA GGGTGACAAA CCCAAGTGCT CATATTGTGG CGAACCGCTC TTTACCTATA TTCCTGAAAA TGACCGCACA GCCATGACCT ATCGCCGCTG GATCAGCGAG ATCGAGAACA ACGGCACCGC CACGGAAGCA CGGACGCACA ACAAACAGTT GCCGTACATC CGATTCCTCA AGCGGATTCC ATTCGACCTC GCCATCTTCG ACGAAGTTCA TAACGCGGCC AACCTGATGA GCAATCAGGG GACCGCGTTC ATCCGGCTTG CCGCAACTGC TGCCAAAGTT CTCGCCCTCA CCGCCACCGT GACCAACGGT ATGGCGAAAT CAGTCTACAA CATCCTGTGG GGGCTGAATC CGAAGCAGAT GCGCGACTCC GGCTGGGAGA TGAAGTCAGC CACAGATTTC CAGACAAAGT ACGGAGCATT CAAAGAGGTC AGGAAAACCG ATGAAAAGAA CCGTCACCGG GAATCGGAAC GTGTCACCAC CTACGATACC GCCGGAATCT CGCCGGCAGC CCTGATCTAT ACGCTCCCCA ACTTCGTCAA TGTGGACAGT GACGACTTCG ATGACCTGCC ACCAGTGGAA CGGGAGGTCA TCAAGTGTCC GTCACACGCT CTGGTGGAAA CCTGTCACCG GAGCATCGAA AACATCATCG AAAAGGCAGA CCTGCCACCG GAAGACCTTC TTGCCGCCGC AGCCGTCCGC AATGCCGCTT TTCTGCGTGT CAGCGACACG TTCCGGCATT ACAACGACGA ACTGAAGCTG CGGGATGTTC CGCTCGGCAC GCTGCACCGG CTGTCGCTTC CCGACGGGGA GCTGCTGCTG GAAAAGGAAG AAAAACTCAT CGCTATCGCC CAAGGGGTGA TTGACCGGGG AGAGCGTCTA CTGGTCTATA CCGGGAACAC GCAGAAGATC GACATGCGCC CCGTCCTGAA ACGCATCCTT CGGGATAACG TCCAGGGCGC CAGCATCGAG ATCCTGCCCG ACTCGGTTGC ACCTGAAAAG CTGGTTGGAT GGTTCGAGCA GGTAACGGCT CAAGTCGTTG TCGTCTCGTT CCATCGCGTC GCCACGGGTC TCAACTTGTC CCAGTTCAAT AACCTTGTCT GGTACGACTA CACGTCCAAT ACCAGACTGG CCGAACAGGG GGACGGGAGA ATCCGTCGCG TCAACACTGC TGACATCCAC CGGGCTCAGT TCGGGGAAGT TCGTCCCGTC CGTTATTGGT ATCTCACCTC TTCAGAAGTC CAGGCGTTGC AGCTTGCCTA CACCCTGGAG AAGAGGATGG TCGCCAAGCT GGCGGAGGGG GAAACACCAG ACATCGACCC GGCGGAATGC AGCAGCAGCC AGTCGTTTTC ATCCCTGATG ACCAAGGCGC TCAAGGAGGG GAACTTCAAC TACTCCGACC CGTCGGCACT GTTGAAGAAG ATGACCCAGC ACGAGAACGC CCGCGTCAGG GGGGACAACA AGGCCGCATC GCCGGTCAGA AACAACGTCA TCCAGCTGCC GGTCACCAGA CCGGAACCCG TTCCCGTTCC ATCATCCATT GCCGTCATCC ACTGTGAAGG GGGCAAGGAA ATCACCCGGG AACTGCCGTT CGGCACCTAC CAGACCTTAC TGGATACAGG AGCGCTGGAA TTCACCCTGT TCGGCGTCTA TCTGCGCCAA CCCGCTCCCA TGCGGAAGCG GGCCTGA
|
Protein sequence | MELTTTINSS FSRTRGNELN KGYYPTFPSD VQTVLRILEP SFGWKSRRYN ALKTISVLDP CAGEGEFLTT VTRWLKRRFA ASNGSPHEIL SYAVELDAGR FARIHGTTQK LNSSFFDVEL SGKFNLILLN PPYNKAAGNE LVTWMEKTAP LLAYEGVMVL IIPEYELKGR LLDIIQGTFT FAYAFQSEEY ARFKQVVVFL AKNQDNSSEI YHQYVRYQSW PAANLGDGAV RGKYTVPGKR QTLSLGGSNG SNRPLMTARD LTEFYRECED RLDKAAGMVL DRQYPSSYDT SIQPISTLRT AHAVQLAAMN SQIESITINC DFFLAKFMVI TKNETFKDPE NNCETMVCKP TVEAFLMDRH GYVKPAREHG FDYYELNSRL SSVLLQKLSR LYQPLHEIGQ DEAYLVDELQ GIGLLPPQRE AVKAVIKAYR SGRRGIGIRA NTGTGKTWMA KAVKYLTNAK RTVMVSEPQL IPQLAREYAN EGFNVHVIDS WEVLKELAAT RPHGLYLIAY TRLRMHPKYR LCIRSRKTIV KKDGNSSVEF TEVCPSCRSP LTEKIRKGDK PKCSYCGEPL FTYIPENDRT AMTYRRWISE IENNGTATEA RTHNKQLPYI RFLKRIPFDL AIFDEVHNAA NLMSNQGTAF IRLAATAAKV LALTATVTNG MAKSVYNILW GLNPKQMRDS GWEMKSATDF QTKYGAFKEV RKTDEKNRHR ESERVTTYDT AGISPAALIY TLPNFVNVDS DDFDDLPPVE REVIKCPSHA LVETCHRSIE NIIEKADLPP EDLLAAAAVR NAAFLRVSDT FRHYNDELKL RDVPLGTLHR LSLPDGELLL EKEEKLIAIA QGVIDRGERL LVYTGNTQKI DMRPVLKRIL RDNVQGASIE ILPDSVAPEK LVGWFEQVTA QVVVVSFHRV ATGLNLSQFN NLVWYDYTSN TRLAEQGDGR IRRVNTADIH RAQFGEVRPV RYWYLTSSEV QALQLAYTLE KRMVAKLAEG ETPDIDPAEC SSSQSFSSLM TKALKEGNFN YSDPSALLKK MTQHENARVR GDNKAASPVR NNVIQLPVTR PEPVPVPSSI AVIHCEGGKE ITRELPFGTY QTLLDTGALE FTLFGVYLRQ PAPMRKRA
|
| |