Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1581 |
Symbol | |
ID | 2685520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1732220 |
End bp | 1734862 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637126261 |
Product | polyA polymerase family protein |
Protein accession | NP_952632 |
Protein GI | 39996681 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG3448] CBS-domain-containing membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.48144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTTA TTACCACCCA CGTGAACGCC GATTTCGATT GTCTCGGCGC CATGGTCGCG GCCAGTAAAC TCTACCCCGA CGCCCTTATG GTCTTCTCCG GTTCCCAGGA AAAGAGCATG AGGGATCTCT TCCTCAAGAC AACCGGCTAC GCACTCCCCT TTACCCGGCT CAGGGATGTG GATTTTTCCG ATATTACGCG ACTGGTTCTC GTAGATTGCC AGCATACCTC GCGTATCGGC CGCTTTGCGG AGGTTGCACG TCGACCCGGC GTCGAGGTCC ATATCTACGA CCACCATCCC GGCTCCAGCG GCGACATCAG GCCGAGCGGC GGAGAGATCC GCGATTGCGG CTCGTCCACG ACTATTCTGA CCCGAAAGCT CATGGAGCAG GGCATCGAGG TGACCGCTGT CGAGGCTACC CTCATGATGC TGGGCATCTA TGAAGACACG GGCAACTTGA CCTTTCCTTC CACCACCCCC GAAGACTATG CCGCAGCATC CTGGCTCCTG GAGCGGGGAG CCAACCTGAA TATCGTGTCG GATTTCGTCT CTCAGGAGCT GACTGCCGAG CAGGTTGCGC TTCTGAACGA TCTGCTGAAG TCCCTGCGCA GTACGCCGGT GAACGGGGTC GACATCGCCG TTGCCCATGC CACTCTCGAC CACTATGTGG GCGACATTGC GGTTCTTGCC CACATGATGC GCGACATGCA GAACCTGGAC GCGATCTTTC TCGTGGTTGG AATGGGGGAG AGGGTCTACC TGGTGGCGCG CAGCCGCATT GCCGAAGTCG ATGCCGGCGC TGTCATGCGC GTCTTCGGCG GGGGAGGGCA TGCCACTGCT GCGGCTGCCA CGGTAAGGGA CCAGACCGTA ATCCAGGTTT TGGGTCGTCT CAACAGACTT CTGCCCGAGC TGGTAAACCC TGTCAGGACA GCCGCTGACC TCATGTCATC GCCGGTGATT ACCCTGCCCC TTGCCACGAC CATCACCGAG GCGCGGGAGA TACTCACCCG TTACAATGTG AATGCCATGC CGGTCATGGA CGGGGAACGG ATGGCGGGGA TCATCTCACG CCGGATCGTG GAAAAGGCGC TCTATCACGG TCTCGGCAAC CTGCCCGTGG ACGAATACAT GCACACGGAG TTTCTGCGGG CCGCTCCCGA CACCCCGATC AACGCTATCC AGGACTATAT CGTCGGCCAG CATCGGCGGC TGGTACCGGT ATTCAGCGGT GAACGGCTCG TGGGGGTCAT AACGCGGACT GATCTCCTGC GGTATATGTA CACCGGCACG CAGCGCAACG CCGAACCGGT CTATGACCTG GGAAGCGAGA ACCTGCCGGT GCGGCGGCGC GAGGTGGTCC ATCTCATGAA CAGGCACCTG CCGCGCCCGA CGGTCGCCAT GCTGCGCGAC CTTGGCAAGG TGGGGGATGA ACTGGAGCTG CCCGTGTACG CAGTCGGTGG TTTTGTGCGG GATCTGCTTC TGGGGGCAGA GAACGACGAC ATCGACGTGT CGGTGGAAGG CGATGGCATC CTGTTCGCCG AGACCGTTGC GAATCGCGTG GGATGCCGGG TGAAAAGTCA CGCCAAATTC GGCACCGCGG TTATTGTCTT TCCTGACGGA CTCAAGGTGG ACGTCGCGAG CACACGGCTC GAATACTACG AAACGCCGGG CGCCCTGCCC ACGGTGGAGC GTTCGTCGCT CAAGATGGAC CTGTACCGGC GTGACTTCAC CATCAACACC CTGGCGGTGA AGCTCAACGC GGAAGGGTTC GGTACCCTGA TCGACTACTT CGGCGCCTAT CGCGACCTGC AGGAAAAGAC CATCCGGGTG CTCCACAACC TGTCCTTTGT CGAGGACCCG ACCAGGGTGT TCCGGGCTAT CCGCTTCGAG CAGCGTCTCG GATTCCCGAT CTCGCGACAC ACGGAAAATC TCATCAAGAA CGCGGTGAAA ATGGGATTTT TGGACAAGCT GGGGGGGCGG AGGCTGCTGA ACGAACTGGT GCTGATCCTT CGGGAGCGGG AGCCGGTCAA GGCCATCCTC CGGATGTCTG GGCTGGGGCT TCTCCGGTTC ATTCACCCGG ACCTGGTCCT GGCGCCCAAC ACCCTGCAGG TGCTGGACGA GGTCAAGAAG GTGATCACCT GGTTCGATCT CCTCTACCTG GGCGAGAAGG TCGAGACGTG GGTGGTGTAC TTCCTGGCGC TCACCTCGAG TCTGCCGGAC GAGGGGTTCT GGGGAACCTG CACGCGGCTT TCCGTATCTG AGCACTACCG GGAAAAACTC ATCGATATGA GAGTTCACGG CGAGCAGGTC CTGGAGGTCA TGACCCGCAA GGCGGCCCGT CGGGAGGATG TGCGCCGCAG CGATATCTAC TTCTGGCTCA GGGGGCTCTC TCCCGAAGTG CTGCTCTACA TCATGGCGAA AACCCGGAGT GATGAGGTGA GACGTTACGT TTCCCTGTAC GTGACCCAGT TGCGGGGCAT CGTTACCCAT ATTACCGGCG ATGACCTCAA GACCTTGGGA ATCCCTTCAG GGCCGCGATA CCGGGAGATC CTCGACCGGG TCCTTACCGC CCGCCTCAAC GGCGAAGCGG CAACCCGCGA CGACGAGATG CGCATTGCCG TGCGTCTGGC GGATTCGGCC TGA
|
Protein sequence | MDVITTHVNA DFDCLGAMVA ASKLYPDALM VFSGSQEKSM RDLFLKTTGY ALPFTRLRDV DFSDITRLVL VDCQHTSRIG RFAEVARRPG VEVHIYDHHP GSSGDIRPSG GEIRDCGSST TILTRKLMEQ GIEVTAVEAT LMMLGIYEDT GNLTFPSTTP EDYAAASWLL ERGANLNIVS DFVSQELTAE QVALLNDLLK SLRSTPVNGV DIAVAHATLD HYVGDIAVLA HMMRDMQNLD AIFLVVGMGE RVYLVARSRI AEVDAGAVMR VFGGGGHATA AAATVRDQTV IQVLGRLNRL LPELVNPVRT AADLMSSPVI TLPLATTITE AREILTRYNV NAMPVMDGER MAGIISRRIV EKALYHGLGN LPVDEYMHTE FLRAAPDTPI NAIQDYIVGQ HRRLVPVFSG ERLVGVITRT DLLRYMYTGT QRNAEPVYDL GSENLPVRRR EVVHLMNRHL PRPTVAMLRD LGKVGDELEL PVYAVGGFVR DLLLGAENDD IDVSVEGDGI LFAETVANRV GCRVKSHAKF GTAVIVFPDG LKVDVASTRL EYYETPGALP TVERSSLKMD LYRRDFTINT LAVKLNAEGF GTLIDYFGAY RDLQEKTIRV LHNLSFVEDP TRVFRAIRFE QRLGFPISRH TENLIKNAVK MGFLDKLGGR RLLNELVLIL REREPVKAIL RMSGLGLLRF IHPDLVLAPN TLQVLDEVKK VITWFDLLYL GEKVETWVVY FLALTSSLPD EGFWGTCTRL SVSEHYREKL IDMRVHGEQV LEVMTRKAAR REDVRRSDIY FWLRGLSPEV LLYIMAKTRS DEVRRYVSLY VTQLRGIVTH ITGDDLKTLG IPSGPRYREI LDRVLTARLN GEAATRDDEM RIAVRLADSA
|
| |