Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0970 |
Symbol | |
ID | 2687553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1044984 |
End bp | 1047935 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637125640 |
Product | hypothetical protein |
Protein accession | NP_952024 |
Protein GI | 39996073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.351823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA AACGATATCG CTCAGACACG CGACCGGCCG CATGCGCCAT CCGCTTGCTT GCCGTGCTGG TTCTGCTGGC CGGTCTGGCG GCCCGGGCCG AAGCCGGGCC GGCCCTTCCG GTCATGGTCA AGGATATCAA CACCGCATCG GTACCCGTCT CGTCCAGCCC CTCCGGCATG ACGGCCAATG GCGGCATCCT CTTCTTCAGT GCCGGCGACG GCGCCAACGG CCCGGAACTC TGGAAGAGCG ACGGTTCCGC GGAAGGGACG GTGCTCGTCA AAGATATCAA TGCCGGGCCC GGGCCGGGCA CGCCTCAGAA TTTCTCCGTC ATGAACGGGA TCACCTATTT TTCCGCCATG GACAGCTTCC GCGGCGTGGA ACTCTGGAAA AGCGACGGCA CGGCTGCTGG GACGGTCATT GTTAAGGATA TCTACCAGGG GGGCGAATCA TCCAACCCCC TGGAGTTGAC AGTGGCAGGG AACACGCTTT TCTTTTCCGC CGATCATCCC GTCTATGGCA AGGAACTCTG GAAGAGCGAC GGCACCGCCG AGGGAACCGT TCTCGTGGCC GATATCGCTG CGGAGGCGAG TTCGACGCCC CAGTGGCTGA GGGCGGTCAA CGGTACGCTC TTCTTCGCAG CCGACGACGG TCTCCATGGC CGGGAGCTCT GGAAGAGCGA CGGGACGCCG GAAGGCACGG TGATGGTCAA GGATATCAAC CCCTTCGGCG GGTCCGATCC GGGCGAGATG GCGGTGTCCG GCGGCATCCT CTACTTTACC GCCGATGACG GTGAAAATGG CCATGAGCTC TGGAGGAGCG ACGGCACCGC CGAAGGGACG TATCTGGTCG CCGATATAGC CCCCGGCGAA GAGAGTTCCT ACCCCTTCGA GCCGGTGGGC ATCAATGGCC TGCTCTATTT TACGGCCAAT GACGGCTACA CCGGGTACGA ACTCTGGCAG AGCGACGGCA CCCCCGAGGG GACGACGCTG GTGAAGGATA TCAATCCGGA CGGCGAAGAC TCAATGCCCT GGGGCATAGT GGGCATGGAC AGGTACGTCT ACTTCGCGGC CGATGACGGC GTCAACGGGT ATGAACTCTG GCGCACCGAC GGCACCATGG GGGGGACGGA GATGGTTGCC GACATCCAGC CCGGGATGGG CGGTTCCATG TACAGCTCTC CCCGGCTGGT GAACGGCATG CTCCTCTTTG CCGCCGACGA CGGAGAGCAC GGCATCGAGA TATGGAAAAG CGACGGCACT GCCGAGGGCA CCCTCATGGT CAGGGATATC ATCCCGGACG CCATGTCGTG GCCATCCGAG CTCATGGTGC ATAACGGTAC ACTCTACTTC GCAGCCGACG ACGGCGTAAA CGGCACGGAG CTCTGGAAGA GCGACGGAAC GGCCGAGGGC ACGGTGCTGG TGCGGAACAT TGCACCCGAG ACGGCCAGCA GTCTTCCCTA CCAACTGGCC GTGATGGGGA CCACGGTATT CTTTGCCGCG GCCGATACCG ATCTTGACTT TGATGTCTGG AAGAGCGACG GTACTGCTGA TGGTACGGTG CTCGTCAAGG AGATCAATCC CGAAGGGTGG GCCTATCTGG ACCGGTTGAT GGTCGTTGGT GATACCCTTT ACTTCCTGGC CGAGGACAAC TATGGAGAGG CCAGCCATGG CATCGAACTC TGGAAGAGCG ACGGCACCGC CGAAGGCACC AGGATGATCA AGGACATCAA CCCCGGGCCC CAGGGGATAT TCTTTCCCGG CAACCCCAAT TATCCCTTCT CCATGGCCGC CGTCGGCACT ACGGTGTATT TCCCCGGTTT TACCGCGGGC AATGGTCATG AACTCTGGAA GAGCGATGGT ACCGCCGAAG GGACGGTCCT CGTGAAGGAT ATCAATCCCG TTTTCGATTT CTCCTCATTT CCCGACAGTT TTACCGCCAT GAACGGAGCG GTCTATTTCG TGGCCGATGA CGGTACGCAT GGCGCCGAGC TCTGGAAGAG CGACGGTACC GCCGACGGCA CCCGGATGGT CAGGGATATC TATCCGGACG GCATCGGCTC CAGTCCCTTG TCGCTCACGG TCATGAACAA CGTCCTCTAC TTTAGTGCCG CCGGTGACGA AGGTGGCTAC GGACTCTGGA AGAGCGACGG TACCGCCGAG GGGACCACGT TCGTAAAGGA CACCTCTCCC TTCAATCACT CGCTTCTCCC TGCCTACCTG ACCCCCGTAA ATGGAACTCT CTTCTTCGCC GCCCACGACG AAAACGCCGG ATTCGAGCTC TGGAAGAGCG ACGGGACCAC CGACGGCACC GTGCTGGTGG CCGATATCCT GCCGGGGGAA GGGGCGTCCA ATCTGCGCTT GCTTACCGGC GTGAACGGTA CCCTGTTTTT CGTGGCCGAC GATGGAGTGC ACGGCGAGGA GCTCTGGAAA AGCGACGGGA CGCCCGAGGG AACGGTGATG GTGAAGGACA TTTTTCCGGG GGATGGGATA TCTGGCATCA CCTGGATCAA GGTGATGAAC GGGATGCTCT ATTTTGCGGC CGACGACGGA GTGAACGGCC TCGAACTCTG GCAGAGCGAC GGAACCGCCG AGGGGACGGT GCTGGTCACG AACATCGTGG CGGGCCAGGG GAGTTCGTCT CCCTCGTATC CGGTGGTGGC GGGCAATACA CTCTACTTTG CCGCTACTGA CGGAGGCAGC GGCGTCGAGC TCTGGAAGTT TTCGCCCGAT CCGCCCGATG GTGATCTGAC CGGCAACGAG ACACTGGAGA TCCCCGATGT GCTGCGCGCT CTGCGGATTG CGGCCGGCAT CGCGGCTCCC ACCGTGGCCG ACTTCATCCA CGGCGATGTG GCCCCCCTTG ACGGAAACGG CCGCCCCGCC CCGGACGGCG TGATAGATAT GAACGACGTG CTGGTTGTCT TACGCAAGAT GCTGGGCGTC GTGTCGTGGT GA
|
Protein sequence | MNSKRYRSDT RPAACAIRLL AVLVLLAGLA ARAEAGPALP VMVKDINTAS VPVSSSPSGM TANGGILFFS AGDGANGPEL WKSDGSAEGT VLVKDINAGP GPGTPQNFSV MNGITYFSAM DSFRGVELWK SDGTAAGTVI VKDIYQGGES SNPLELTVAG NTLFFSADHP VYGKELWKSD GTAEGTVLVA DIAAEASSTP QWLRAVNGTL FFAADDGLHG RELWKSDGTP EGTVMVKDIN PFGGSDPGEM AVSGGILYFT ADDGENGHEL WRSDGTAEGT YLVADIAPGE ESSYPFEPVG INGLLYFTAN DGYTGYELWQ SDGTPEGTTL VKDINPDGED SMPWGIVGMD RYVYFAADDG VNGYELWRTD GTMGGTEMVA DIQPGMGGSM YSSPRLVNGM LLFAADDGEH GIEIWKSDGT AEGTLMVRDI IPDAMSWPSE LMVHNGTLYF AADDGVNGTE LWKSDGTAEG TVLVRNIAPE TASSLPYQLA VMGTTVFFAA ADTDLDFDVW KSDGTADGTV LVKEINPEGW AYLDRLMVVG DTLYFLAEDN YGEASHGIEL WKSDGTAEGT RMIKDINPGP QGIFFPGNPN YPFSMAAVGT TVYFPGFTAG NGHELWKSDG TAEGTVLVKD INPVFDFSSF PDSFTAMNGA VYFVADDGTH GAELWKSDGT ADGTRMVRDI YPDGIGSSPL SLTVMNNVLY FSAAGDEGGY GLWKSDGTAE GTTFVKDTSP FNHSLLPAYL TPVNGTLFFA AHDENAGFEL WKSDGTTDGT VLVADILPGE GASNLRLLTG VNGTLFFVAD DGVHGEELWK SDGTPEGTVM VKDIFPGDGI SGITWIKVMN GMLYFAADDG VNGLELWQSD GTAEGTVLVT NIVAGQGSSS PSYPVVAGNT LYFAATDGGS GVELWKFSPD PPDGDLTGNE TLEIPDVLRA LRIAAGIAAP TVADFIHGDV APLDGNGRPA PDGVIDMNDV LVVLRKMLGV VSW
|
| |