Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3233 |
Symbol | |
ID | 2688298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3547120 |
End bp | 3550323 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637127926 |
Product | cytochrome c family protein |
Protein accession | NP_954274 |
Protein GI | 39998323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0173325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGAACT TAATGGATCA AGACTCGGCA ACCCCGGGTG TGCAGGACAA GAAGTACACT GAAGCCGAAT TCAAGGCACA GGTCGACCTC TCCATGTGGG ACTCGGCCGT TTACTGCGGC AGCTGCCACG TCGGCGGCGG TTTTGTGGAA AAGGACCGGA ATGGTGTCCG CTACTCCCAG CGCCAGCCCG ATGCGGGCGG CCTCTTCGAC GCCTACCTCT TTTACGTGAA CGACACCTAT GATCCCCTGA CCGGCATGCC GACCGAGAAG GTCGCCATGG CCCCCTGGAT CTATCCCCAG TATGCGGATA ACAATCCCGC CAACGGGCCG GTCCTCGCCC CCAACGGTTG GGGCCGCGCC GCCATGAGCC AGCCGGGCAT GCCCATCAGC GACGGGCAGC TCATGATGCC GAACGTCAAG GAAATGGACT GCCTCTACTG TCACTTCGAG GGGTATGACA ACCTGATGCA CTCGGTCATG AACTACTCCG GTGCACTCAA CGTGACGGCC ACCGTGGGCG CGGGACTCAT GGACACCAAT AAGATGAGCC CCACCTACCA GGGCTACAAC GTCTCCCTGG TGGACGTGGA CCAGAACGGC ATCGTCTCGC TCAACTCCAC GGCCCTCAGC CGCATTAAAG CCAATCCGCC GGCCGACAAC TGCCGGAAGT GCCACATGCC CACCTCGCTC ACCGACCTGC CGAGCATGAT GAAGGACTTC CTGTCGTCGG CGCCCATGAT CTACACGGGC AACTTCTCGG CCTCCATGAC CGGCCTCGAG ATGCCCGCCT TCGACTTCAA CGCACCCTTC GGCTTCACAT GGAACTTCAG CCAGGGTCCC TACGGCATCT CGCCGGTGGT CAACGTCACC AACATCTCCA GCTACATGAT CGCCGCCATC GGCTACCCGG CGGCCATGGG CCAGACCATC TACAACAGCA TGCCCGGCTT CGGTGGCACG GTTATCAACC CGATGATCGG CGAAATCGGC GGCGGCAACA AAGCCGGCAC CGGCCCGCTC TACTACGAGA AGCTCGTGTT CGACGAAAGC GGCAACCCGG TTATGATGGG CATGGCTCCT GCCATGGATC AGAACTCCCT CAAGAAGGGC ATCGTTCCCT TCCCGCGGGC CGAATGGTTC AAGCGGGGTG ACCTGTGGGA CAACCGCGAC CAGGAAGTTC ACATCGGCAT GGAGTGCGCC GGCTGCCACA TGGATACCGA CACCCTGAAG GTTGACGGCC CCGGCAAAGA CGGCAAGAGC CTCTGCGATC CGGGCCGCGG CTACGACAGC GCCAGCGGCG TTGAGACCAG CACAGCCCTC GGCATCGACA GCCGCAACAC CATCAAGAAG TGCGCTGACT GCCACGTGAC CGGCAAGAAC AGCGACGGCG TCGCCATCGA TACCTTCAAC GCTCCCAACC CGACCTCGGC CCACGCCATG TTCAACCTCA CCGCTCCCAT GGTCAACGCG GTCCGGATGA AGGCCGACGG CACTGGCGAA GAAACCTTCC TCGGCAGCCA CATGGACATC ATTGACTGCG CCGTCTGCCA CACCTACAAA AAACAGATGG TTGTCCGCGC CCTCGACTCC ACCTCCGGCA ACCGCTACCC GAACATGCTC GGCTTCGACA CCTCCAAAGG GATGCTCGGC ATGTTCAACG AGCCGATGCC CGGCATGACC AATGAAGGCG TCGAGTGGAA ACCGCTCTAC ACCTGGGCCA AGATCGGCGA CGGCGACAAG ATCCTGCCCG ACGGCAGCGC CAACCCCAAC TGGCGCCGCA AGGTCTACCC GATCAACATC ATCACTGCAG TCCTCTGGAA CAACATCGAT CCTTCCGTTG ATGCCAACGG CGACGGCGTC ACCGGCCGTC CCGCAGCCCC GAACCTGACC TACTACGACC CGTGGATCAG CCGCGACATG AAGGCCGGCG TCAATTACGG CCCCAGCGGT TTCGCTCCCG TGCCGGTAGG CTTCGGCAAC GGCGCCTTCC AGAGCGCCTA CAACCCCGAC GGCACCTTCA CCGGCGCCTG GAACTACGTG GGTGTGTACG GCGGCAACGC GGTCTTCTCG ACGCCCGAGG AGATTGAAAA CTACAAGGCC TTCCGCAACA CCATCGCCCC CGCCGTTGAC GGCAAATCGT GGAGCGGGAC CCAGCTGCTG CTTCTGGGCG GCCCCTACAT GATCACCCAC AACGTCCGGA GCACCGCAAA CTTCGTCCTT GGTAAGAGCT GCGGCGACTG CCACGCCGCT GGTAAAGGGT TCTTCGACGG CGGCTTCAAC ATGACCGGTA CCGCCATCAA GGCCAATGCC GGCGCCAGCT TCATGCAGTC CCCCGCAGAA ATCCTCCAGA TCGTTGCCAA GGCAGAGGAT CTGGAAACCG GCGCCGAACT GGCCACCAAG TCGGGCGCTG CCCGCGAGGT CAAGTTCGAG GAGCACGGCG ACTGGAACCC GGCCACCAAG ACCTTCACTC CCAACCCGGC CGGTGAGTAC AAGAAGGCCA TCGATCTGAG CCGTAGCGAG GCCCTCTATC CCGATGAAGG AACCTTCACC GCCGCCGACG GCACTGTCTA TCCGAACCGC GCCGCTTACG TCGCCTACCT GACCGGCATC GGGGCAGCTC CCACGGCAGA CATCGCCACC GTCGGATCCA ACAACACCGC CGGCCTCCCG ACCACGGCCG AAGTAACCGT CACCCAAGGC ACGGTAACCC TGGCAGCTAC TGCAGCCGGC CCGCTCGCAG CGGACAAGTA CTCCTACGCC TGGACCTGCA GTGATTCCAA TGTTACACTT TCCGGGCAGT CCGTAACGCG CGCCTTCAGC GCCACCGGCA CCTACACCCT GACGCTGCGG GTGAAGAACC TCAGCACCAA CGAAGAGAAA GTCGACCAGA TCAAGGTCAA GGTCACCGCT CCGGCTCCGG CCGCGGGCGT CGCCGTGGCC GCCGCCGGCA TCTCCTACAA CAGCCCCTCC GCCGGCTACG CCATCATCCC GCTCACCATC ACCGGCGTGA CCTTCAACAA GGTCAAGGTC GTCTGGGGTG ACGGCAACAC CAACATCTAC ACAACCTCCG ACGCAAACTT CGTCCTGCCC TCCCACAAGT TCTGGGGCTA CCCGGCGAAG AAATCCTTCA CCGCCAAGGT CTATGTCTAC AACGGCACCA CCCTGGCGGC GCAGAACGAT AACATTTCCG TGCTGTTTCC CTAA
|
Protein sequence | MANLMDQDSA TPGVQDKKYT EAEFKAQVDL SMWDSAVYCG SCHVGGGFVE KDRNGVRYSQ RQPDAGGLFD AYLFYVNDTY DPLTGMPTEK VAMAPWIYPQ YADNNPANGP VLAPNGWGRA AMSQPGMPIS DGQLMMPNVK EMDCLYCHFE GYDNLMHSVM NYSGALNVTA TVGAGLMDTN KMSPTYQGYN VSLVDVDQNG IVSLNSTALS RIKANPPADN CRKCHMPTSL TDLPSMMKDF LSSAPMIYTG NFSASMTGLE MPAFDFNAPF GFTWNFSQGP YGISPVVNVT NISSYMIAAI GYPAAMGQTI YNSMPGFGGT VINPMIGEIG GGNKAGTGPL YYEKLVFDES GNPVMMGMAP AMDQNSLKKG IVPFPRAEWF KRGDLWDNRD QEVHIGMECA GCHMDTDTLK VDGPGKDGKS LCDPGRGYDS ASGVETSTAL GIDSRNTIKK CADCHVTGKN SDGVAIDTFN APNPTSAHAM FNLTAPMVNA VRMKADGTGE ETFLGSHMDI IDCAVCHTYK KQMVVRALDS TSGNRYPNML GFDTSKGMLG MFNEPMPGMT NEGVEWKPLY TWAKIGDGDK ILPDGSANPN WRRKVYPINI ITAVLWNNID PSVDANGDGV TGRPAAPNLT YYDPWISRDM KAGVNYGPSG FAPVPVGFGN GAFQSAYNPD GTFTGAWNYV GVYGGNAVFS TPEEIENYKA FRNTIAPAVD GKSWSGTQLL LLGGPYMITH NVRSTANFVL GKSCGDCHAA GKGFFDGGFN MTGTAIKANA GASFMQSPAE ILQIVAKAED LETGAELATK SGAAREVKFE EHGDWNPATK TFTPNPAGEY KKAIDLSRSE ALYPDEGTFT AADGTVYPNR AAYVAYLTGI GAAPTADIAT VGSNNTAGLP TTAEVTVTQG TVTLAATAAG PLAADKYSYA WTCSDSNVTL SGQSVTRAFS ATGTYTLTLR VKNLSTNEEK VDQIKVKVTA PAPAAGVAVA AAGISYNSPS AGYAIIPLTI TGVTFNKVKV VWGDGNTNIY TTSDANFVLP SHKFWGYPAK KSFTAKVYVY NGTTLAAQND NISVLFP
|
| |