Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0286 |
Symbol | |
ID | 2686797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 315412 |
End bp | 317049 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637124952 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | NP_951346 |
Protein GI | 39995395 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0933039 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCAA ACGGAGGGAA CGCAACGGGC AGGCCTGCGG CAGGAGGCGG GCAGTACGGG ATCGTGCTGG CTGAACTCTA TCGGGCGCTG AAGGCCCTGA CGTTTTATCC TGAGGGGCAT CCTCAGCGCG CCGAAGTCCT GGCACGGGCC CATGCCGCGC TCCGGGGAAT CCTCTGGGGA ACAGAGCTCG TGTTCGTCAT CGGTAAGAAC GGTTTCACCG CCTCGGAGGG CGGGGCTTTT GTGGAGCCGA CAGCAATGAC CCAGGCCCTT GCGCGGGAGC TGTTCATCCG ACGGGTAAAG CGGCTCACGA TCCTGCCCGA CGTCGGCGAG GCCGACCTGT CCTTGTTTCT CACCCTTTTA TCCATGGACC ATCGCGACGT CCACGAGGCG GGAGGGATAG AGGCGCTTAT GGCCCAACTC GGCCTGACGA CCATCTGGGT CAACGAAGTG GACCTTGACG AAATCCGGCG CAAGCGGGCG GTCGTCGAGC AAACCCGTTC TGCTGCGGCC CATGACGGCA CCTCTGACAA TGTATTGTCT GCCATCGAAA AGGTGGCGGA CCAGGACGGG ACACCCGAAG AGGGCAGGGA CGCGGCAGGA CTGCAGGAAG AGGATCGGTT GGAGGCGGAA GCCATACTCG GCCGTATGGA GCGGGAAACC AGCGACGATC GGTACCGGGA GCTCGCCCGC CTGTTGTCGG CCCGTTGCGC GGAACTCGGC GACCGGGGTG AGTTCGAGCG GATTCTGTGG GTGTTGGTGA ACCTGCACCG GCATGCGAAG AGTGAAGCGG CGAGCGCCGC GCGGCGGGGG TATGCCCTTC TTGCCTTCGA GAAGGCGGCC GGGGGGCCCA TGCTCCCGTT TCTCGTAGGG CGCCTGGAGG AGCGCGGGGA GGAGCGGGAG ACCCTCGTCG ACCTGTTCCG GGAGATTGGC GAACCAGCGG TTGCGCTGCT GGTTGAACGC CTCACGCTGG CAGAGAGCAG GAATGCCCGG AAGCTGATCA TCGATGCACT CGTGGCAATC GGTGCGGAGT CGGTGGCTCT TCTGACCGAG CATCTTGTCG ACCGTCGCTG GTACGTGGTG CGGAACGCGG CGGTCATCCT CGGCGAGATC GGCGATCCTG CAAGCGCCGA ACCCCTGCGG GCCTGTTTAG TTCATACGGA CGTAAGGGTA CGCCGGGAGG CGCTCAGGAG CCTCGTCAAG ATCGGCGGTG AACAGGCAGA GGACGCGATT ATCGGCCTGC TCGCAGCGGA GGATAATCTG ACGAAGCGTC ATGCCGTGAT CTCGCTTGGT CTCCTGAAGA GCCGCCGCGC GGTGGAGCCT CTTTGTGCTC TCATCGAACG CCGCGATCCC TTTAAAAAAT CGCTTGCCTT CAAGAGGGAC GTCATCCAGT CCCTCGGCCG GATCGGAGAC CGCCGGGCGG TGCCGTCTCT GCTCCGGCTG CTGGAGCATC GGCCCTGGTT CGGCCGACGC CGTTGGGACG ACGTCCGGAT GGCGGTGGTA ACCACCTTGG CCCAGATCGG TGACCCGGCC GCTCTGGGAC TGCTGGATTC GCTGGCAGCG AAAGGCGGCA TGCTTGCGGC GGCCAGTGCC GATGCAGCAG AAGACATCCG CCGTCGAGGG GGGGTCGTTA ATGATTAA
|
Protein sequence | MGANGGNATG RPAAGGGQYG IVLAELYRAL KALTFYPEGH PQRAEVLARA HAALRGILWG TELVFVIGKN GFTASEGGAF VEPTAMTQAL ARELFIRRVK RLTILPDVGE ADLSLFLTLL SMDHRDVHEA GGIEALMAQL GLTTIWVNEV DLDEIRRKRA VVEQTRSAAA HDGTSDNVLS AIEKVADQDG TPEEGRDAAG LQEEDRLEAE AILGRMERET SDDRYRELAR LLSARCAELG DRGEFERILW VLVNLHRHAK SEAASAARRG YALLAFEKAA GGPMLPFLVG RLEERGEERE TLVDLFREIG EPAVALLVER LTLAESRNAR KLIIDALVAI GAESVALLTE HLVDRRWYVV RNAAVILGEI GDPASAEPLR ACLVHTDVRV RREALRSLVK IGGEQAEDAI IGLLAAEDNL TKRHAVISLG LLKSRRAVEP LCALIERRDP FKKSLAFKRD VIQSLGRIGD RRAVPSLLRL LEHRPWFGRR RWDDVRMAVV TTLAQIGDPA ALGLLDSLAA KGGMLAAASA DAAEDIRRRG GVVND
|
| |