Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3219 |
Symbol | |
ID | 2688321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3525819 |
End bp | 3531911 |
Gene Length | 6093 bp |
Protein Length | 2030 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637127912 |
Product | fibronectin type III domain-containing protein |
Protein accession | NP_954260 |
Protein GI | 39998309 |
COG category | [S] Function unknown |
COG ID | [COG1572] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.341316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGA GGCGATGTAT CGCGTCGCTG GCGACCGTGG GGATGCTCGC CATGCTGTTG GTCCAGGCGG AGCCCCGACA GGTCACGGGC AGCGGCTATA GCGCGGAGAC CGGCAAGAGA CAGCGGCCGG CCCCCAAAGA ATCCGGGTCG GCAGACAAGG CAATACGAAA GCGATGGCTC GTCCAGTTCA ACGGTCCGGT CCGGCCTGAG CAGCGACGGC AGTTGGAGGC GCTCGGCTGC CGTATCGGCG ATTACATGCC CACGAATGCG TTTGTTGCGC TCATGGACGA CAAGGCGGCC AAAAGAGTTG CCCTTCTCTC ATTCGTCGAG GACATCACCC GATTTGCACC GGCCGACAAA CTGGTGGGCA CGGCGCGGAA GGACCTGACG GCGGCACCGA CGTCCGAGAT CCGGATCCGG AAGGTGCTCC GGGTGGACGA CCCCGCCGAC CGGGCAGCGG TCATTGCCGC CACCCTGCGG GGAAATGGCC GGATACTGAA CGTGGGTGCA CGCACCATTA CCGTGGAGGT ACCGGAGGAG CTGCTCGCCC CCCTTGCCCA GCAGGAGGAG ACAGCCTGGA TCGGCGAGGT CGGGGAGTTG CGGCTGCACA ACAGCGATGC TGCCTGGGTG GTACAGACCA ATGAGGTCGA TAATCGTACG ATCTGGGAGA AGGGAATTAC CGGCGCGGGG CAGATCGTCG GTATCGCCGA CTCGGGGGTC GACTACGACA TGCCCTGGTT CGCCGATCCG AACGGCGCTC TTCCCGGGCC GGGACACCGC AAGATCGTGG GGTACGACGC CACTCTGGGA GACAACCACG ATGTGGCCGA CGGCCACGGC ACCCACATCG CCGGGACCAT CTGCGGCGAC CGGGGGCCCG GCATGCCCGG CAACGGCATT GCACCCGGCG CGCGCATCCA CGTCCAGGAC CTGGTCGGCA CCGACGGCAC GCTGACCGGC AGCCTGGAAC TGGAAACCGT GCTGAAAAAA GCCTATGACA GCGGAGCCCG GATTTTCAAC GGTAGTTGGG GGGTCGATAG CGGGAACTAC GACGCCCTCG CCGCGGCCCT TGACGACTTT TCCTGGCGGC ACAAAGATTT CCTGGCCGTG TTCGCCAACG GCAACGGCGG CCCGGCGGAG CAGACGGCAA CCAGTCCGGC CATAGCCAAG AACGCCACCA GCGTAGTGGC AACCGGCAAC GGAACCGATG CCGCCACGGT CAGCGCCGAA AGCAGTGTCG GCCAGGCTCC GGATGGCCGT GCCAATCCGT CGGTGGGCGC TCCGGGCCAG GGTGTGGTCT CGGCTCGCTC CGACGGCCTT CTCGGCAGCG GCAACAGCGG CACCATGGCC ATGTCGGGCA CCTCGGTGGC AGCGGCGGTC ACCTCGGGAG CAGCCGCGCT CATCCGCCAG TACTTTACCG ACGGCTTCTT CCCCACCGGC AGCCCTGTCG CCACCAACAA GCTGCAGCCC TCGGCAGCAC TGCTGAAGGC AGTCCTCGTG AACAGCGCCG AAGCGCTCCT CTCGGACGAC CCCGGCGACT CGTGCCCCTC GAAGAGGGGC GGCTGGGGGC GCCCCAAACT GATCAACACC CTCTTTTTCA ACGGTGACAG CCACTCCCTG GAAGTAGTCG ACGGGGGTAC AGGGCTGGAA ACCGACGGCG TCTGGCAGCG GCTCTACTTC TCCCCCGGCG GCAGAAGGCT CAAGATCACA CTGGCCTGGA CCGACGCCCC GGCGGCACCC GGTGCCACGT CCCCTCTCAC CAATGACCTG AATCTGGTGG TTGTCGCACC GGACGGAACG ACCTACCTGG GCAATGATCT GAACTGTTCC CACGGCGACT ATGAATCACG GACCGGCGGT TTCTCCGACA GGGTCAACGT TGAAGAGCAG GTGGTCATCA AGCGGCCCGT GGCCGGCACC TACCTGGTCA AGGTGATCGG CGCCAGCATT CCGGTGGGTC CCCAGCCCTT CGCTCTGGTA ATGACCGGGG TCACCGGGGT CACCTCGGAC GGCCGGATCG CCCTGACCAA CAGCACTAAC GGCACCCTGG AGGCGCCGGG ACAGGTGTCG GTCATGGTGA CCGACCGGGA TATCAACCGC GATGCCTCAG CGATCGAGAC CATGACGGTG GACCTGCTCG GCGAGACCGA GAGCAACCCG GAGCAGGTGG TCCTCACCGA AACCGGCCCC AATACCGGGA CCTTCACCGG CACCTGCCGG ATCGCACTCG GCGGCACCGC CATCCATGAC AACGGTGCGC TGGAAGCGCG CCATGGCGAG ACGATCAGCG CCCGCTATAC CGATGAAATC AACCTGAGCG GCTATCCCCG CCTCGTGAGC GTGTCGGCCC GGATTGTCGA CAGCGTGCCG CCGACGATCT CGGCGGTTGG CGTCGGAGCC CAGCTCTCCG AAACCTCGGC GACCGTAGCC TGGACCACCG ACGAACCCGC AGACTCGAAG GTCAGCTACG GCTCTGACGG CTCCCTGGGT CTCAGTGTCA TCGACGGAGC CTTTGTCTCG AACCATCTCC TGGCACTCTC AGGTCTGTCG GAAGGACAGG ACTATTTCTT TTCCGTCACC TCATCCGACG CCGCCGGCAA CGTCTCCACG GATACCAATG GCGGCAACAA CTATACCTTC CGCACCGCCA GCCTCCCGCC GTCCCTGGAG GTCTTTTGCT CGGCGGAAAA CAACGAGACC TATCTCCCGA CGGTCAGGGT CTTCGGCACC GCCACCGATC CGGCCGGAAT CGACCGGGTG ACGGTCAACG GCCAACCAGC CGTCTGGCGC GCCACCGATG GCTACTACGA GGCCACCGCC TCCCTGGTGC CCGGCAGCAA CACCATTACC GTAATCGCGA CGGATACGCT CAACAACCCG GCCGGGCAAA CCCTGACGGT CAACAGGACC CTGCCCCCCT TTGATCTGCT CGTGACGTCG GTCGTCTCTA CCACGAACCT CCTGCCAAGC TCGGCCATCA GGGTGGACGG GACGGTGCGG AACGAAGGCA CCGCGGATGC CCCGTCGGCG GAGGTTGCCT TTTACCTCTC CCGCGGCGGG GCTGCCGCCG GCATCCCCCT CGGCAGTATC CCGATCTCAC CCATCCCGGC GGGCCAAAGC GGCGCAGTTT CCTTCACCGC CAGCTTGCCT CCGGAAGTCG TACCAGGAGT TTACTTCATC GTCGCAACCG TCGACCCCGC CGACCAGGTG GCCGAAGCCC GGGAGGACAA CAACTCCCTG ACCGGCAATC CGGTTACGGT GGGCCGCCCC GATCTGGTAC CGCTGACGGT GAGCAACACA ACCCTCATGT CGCCGGGAGG CACCATTTCG ACGAGCCTGT CGGTGCGCAA CGACGGAGCC GCCAGCGCAC CCGTTTCAAC AGTCGCTGCC TGGCTTTCGG TCGACACGAA CCTCTCCGGC GACGATATCC TGATCGGTAC CGCCCCCGCC GCCGCTCTGT CTCCCGGCGC GTCAACCACG GTCGGCATCA GCGGCGTCCT CCCGCCAGGC ATCCAGTCGG GCACCCTGAA CATTATCGCG GTCGTCGACG CCTTGGGTGA GGTGGCAGAG GCGGACGAAA ACAATAATCG GGCAACCGGG CAGCCTCTTA CCATCGGCAC GGCAGAACTA TCCGTGACTA CGGTAACGAT GCCGGCCAGC ATTGTCCGGG GCTCGACCGC CTCCGCCACC GCAACCGTCG CCAATACGGG TCACTATGCA GCAACCGGCG TGCGCGTCGG CGTCTATCTC TCGTCCGACA CCGCCATCAC CACCTCCGAC ATGTTTTTGG GGAGCGGCGT CATTGCATCC CTGGAACCCG GGGCATCGGC GCCGGTGAGC ATACCAGTTC CCATAACGGA CATCGTTGCC GCGGGGACAT GGTATGTGGG CGCCGTGGTC GACGATCTCG GGATGATCGC AGAATCCGAC GAGGGCAATA ACGCCCTGGC CGGCAATCAG GTCGAGATCC TGGCGGACGG CCTTGACCTG ACCGTTCAGG GTGTGACCGC GCCAGCCTCC GGCACCACCG GTCAACCGGT CACCATTACG GCAACGGTTG CCGCCACCAT GCCGGCGGCG GCATCCGCGG TCCAATTCTT CGTCTCGCGT GATCCGGTCA TAACCAGCGC CGATACGTAT CTGGCCACCA AGGCGGTCGG CTCCTTCGGC GCCGCAGGCG CACAGACCGT CACTGCCACC GTGACGCTTC CCACGACGCT GACCAGCGAC ACCTGGTATC TCGGTGCCAT TGCCGACGCC TATGGAGTGA TTACCGAAAT CAATGAAACC AACAACGCCT CTGCCGGCCG TGCCATTACG GTTAACGGTC CCGAGCTTGT CGTGGAATCC CTTACTTCCG CCTCCGACAC GGCTTACACG GCGGGCACCG TCAGCCTGGC CAGCACCATC AGGAGCATGG CCGGCGCCGC ACCCACCCAC CGGGTAGAGT TCTATCTCTC CACCGACCCG GCAATCACCA CCTCCGACAT TTACCTCGGC TACCGCACCG CGTCCCTCCC CGCGGGGGGA AGCAGCACGG CGACCACAAT CCTGACGATC CCCCGCTACC TGACCGGGGG AGACTATTAC ATCGGTGCCA TCGCAGACCC CGGCAACGTC ATTGCAGAGG CGAACGAGAA CGACAACTCC CTGGGCATCC CCCTGCACAT CATCGGCCCC GATCTCCAGG TAGACGGCCT GAGCCTCCCC GGCAGCGCCC TGTCAGGGGT CCCCCTCACC ATTTCCAGCC GGGCATTTTC GACCCAGGGC GGATCCGGAA GTTTTACGGT CGACTTCTAC CTTTCCTCAG ACCAGACCAT TACTACCGGC GACGTGTACC TTGGCCGCCG CACGGTGAGC AGCCTGGCTG TGGCCGGAGC GAGCACCGCA ACGGCCACGG TGACGATTCC GAATTACGTC TTCACGGGCC GCTATTACGT GGGCGCCATC GTGGACCCCT ACAATTACGT GAAAGAGGAA ACGGAGACCA ACAACTCGAC CGGGACAGAC CAGGCAGTCG CCGTGGAGGT AACGGGTGCC GAACTGGCAC TAGCGTCTCT CTCGGCACCG GCCTCGGCCA AGCCCGGCGA AACCATCGCC GTGGTCAACG CCCTTGCCAC TACGGCAGGC TCGGCGCCCT CATCGTACAT GGAGTTCTAC CTGTCGACCG ACAGCATCAT CACCGCGGCC GACCGCTATC TCGGAGGCCG CACCGTCTCA GCCCTGCCCG CGGGGGGGGC CAACAACGCC GCAACCGGCC TGAAGGTTCC AGCGGACATC CTGCCCGGGA CCTACTACCT CGGCGCCGTG AGCGACCCCT ACAACACGGT CAGGGAGGCC AACGAGGCTG ACAACACCCG AACCGTGCAG CTCACCGTCA CCGGAAGGGA TCTGACGGTC GAGGCACTGA GCGGGCCGGC CGCCGCCCTC GCGGGGGCGA CCATCGGAGT CGCCAATGCC GTGAAGAGCG CCGGCGGCGC AGTGCCGGGC TTCGATGTCA CCTTCTACCT GTCGCGCGAT GCCGTCATAA CCCGCTCAGA TGCCTACCTG GGGACCCGCT TCGTATCCGG CCTGGACATC GATGGCGCGA ACACGGTGAC TACCACCCTC AAGCTCCCCA ATGATCTGGA GGGCGGACGC TACTACCTGG GTGCCATCGT CGACGGTGGC AACCTGATCC CCGAGACCGA CGAGAGCAAC AACGCATCGG CTGCGGCCCC CATCGACCTG GTTGGCGCCG ACCTCGCGGT CTCGGCGCTG ACCGCGCCGG CTACGGCCAG TGCCGGCGAA ACGATCTCCG CCCAGGTGAT CGTCACAACC CGTGCCGGCG GCTCTCCTTA CTCGCTCGTT AACTATTACC TGTCAACCGA CGAGACTGTC TCGCCAGACG ACATCTATCT GGGAGTATCC ACCATACCTT CACTGGGCCC GGGCGGCGGC GCCACTGTCG GCAAGTCGGT GAAACTGCCC GCGGACATGG AGCCGGCAAC CTACTACCTG ATTGCCGTTG CAGACCCTGC CAATGCAGTG GCAGAGGCGG ATGAAACCAA CAATACCTCT CAACCGCGCG CCATTGCGGT GACTGTACCC TAA
|
Protein sequence | MKLRRCIASL ATVGMLAMLL VQAEPRQVTG SGYSAETGKR QRPAPKESGS ADKAIRKRWL VQFNGPVRPE QRRQLEALGC RIGDYMPTNA FVALMDDKAA KRVALLSFVE DITRFAPADK LVGTARKDLT AAPTSEIRIR KVLRVDDPAD RAAVIAATLR GNGRILNVGA RTITVEVPEE LLAPLAQQEE TAWIGEVGEL RLHNSDAAWV VQTNEVDNRT IWEKGITGAG QIVGIADSGV DYDMPWFADP NGALPGPGHR KIVGYDATLG DNHDVADGHG THIAGTICGD RGPGMPGNGI APGARIHVQD LVGTDGTLTG SLELETVLKK AYDSGARIFN GSWGVDSGNY DALAAALDDF SWRHKDFLAV FANGNGGPAE QTATSPAIAK NATSVVATGN GTDAATVSAE SSVGQAPDGR ANPSVGAPGQ GVVSARSDGL LGSGNSGTMA MSGTSVAAAV TSGAAALIRQ YFTDGFFPTG SPVATNKLQP SAALLKAVLV NSAEALLSDD PGDSCPSKRG GWGRPKLINT LFFNGDSHSL EVVDGGTGLE TDGVWQRLYF SPGGRRLKIT LAWTDAPAAP GATSPLTNDL NLVVVAPDGT TYLGNDLNCS HGDYESRTGG FSDRVNVEEQ VVIKRPVAGT YLVKVIGASI PVGPQPFALV MTGVTGVTSD GRIALTNSTN GTLEAPGQVS VMVTDRDINR DASAIETMTV DLLGETESNP EQVVLTETGP NTGTFTGTCR IALGGTAIHD NGALEARHGE TISARYTDEI NLSGYPRLVS VSARIVDSVP PTISAVGVGA QLSETSATVA WTTDEPADSK VSYGSDGSLG LSVIDGAFVS NHLLALSGLS EGQDYFFSVT SSDAAGNVST DTNGGNNYTF RTASLPPSLE VFCSAENNET YLPTVRVFGT ATDPAGIDRV TVNGQPAVWR ATDGYYEATA SLVPGSNTIT VIATDTLNNP AGQTLTVNRT LPPFDLLVTS VVSTTNLLPS SAIRVDGTVR NEGTADAPSA EVAFYLSRGG AAAGIPLGSI PISPIPAGQS GAVSFTASLP PEVVPGVYFI VATVDPADQV AEAREDNNSL TGNPVTVGRP DLVPLTVSNT TLMSPGGTIS TSLSVRNDGA ASAPVSTVAA WLSVDTNLSG DDILIGTAPA AALSPGASTT VGISGVLPPG IQSGTLNIIA VVDALGEVAE ADENNNRATG QPLTIGTAEL SVTTVTMPAS IVRGSTASAT ATVANTGHYA ATGVRVGVYL SSDTAITTSD MFLGSGVIAS LEPGASAPVS IPVPITDIVA AGTWYVGAVV DDLGMIAESD EGNNALAGNQ VEILADGLDL TVQGVTAPAS GTTGQPVTIT ATVAATMPAA ASAVQFFVSR DPVITSADTY LATKAVGSFG AAGAQTVTAT VTLPTTLTSD TWYLGAIADA YGVITEINET NNASAGRAIT VNGPELVVES LTSASDTAYT AGTVSLASTI RSMAGAAPTH RVEFYLSTDP AITTSDIYLG YRTASLPAGG SSTATTILTI PRYLTGGDYY IGAIADPGNV IAEANENDNS LGIPLHIIGP DLQVDGLSLP GSALSGVPLT ISSRAFSTQG GSGSFTVDFY LSSDQTITTG DVYLGRRTVS SLAVAGASTA TATVTIPNYV FTGRYYVGAI VDPYNYVKEE TETNNSTGTD QAVAVEVTGA ELALASLSAP ASAKPGETIA VVNALATTAG SAPSSYMEFY LSTDSIITAA DRYLGGRTVS ALPAGGANNA ATGLKVPADI LPGTYYLGAV SDPYNTVREA NEADNTRTVQ LTVTGRDLTV EALSGPAAAL AGATIGVANA VKSAGGAVPG FDVTFYLSRD AVITRSDAYL GTRFVSGLDI DGANTVTTTL KLPNDLEGGR YYLGAIVDGG NLIPETDESN NASAAAPIDL VGADLAVSAL TAPATASAGE TISAQVIVTT RAGGSPYSLV NYYLSTDETV SPDDIYLGVS TIPSLGPGGG ATVGKSVKLP ADMEPATYYL IAVADPANAV AEADETNNTS QPRAIAVTVP
|
| |