Gene GSU3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3219 
Symbol 
ID2688321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3525819 
End bp3531911 
Gene Length6093 bp 
Protein Length2030 aa 
Translation table11 
GC content65% 
IMG OID637127912 
Productfibronectin type III domain-containing protein 
Protein accessionNP_954260 
Protein GI39998309 
COG category[S] Function unknown 
COG ID[COG1572] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.341316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA GGCGATGTAT CGCGTCGCTG GCGACCGTGG GGATGCTCGC CATGCTGTTG 
GTCCAGGCGG AGCCCCGACA GGTCACGGGC AGCGGCTATA GCGCGGAGAC CGGCAAGAGA
CAGCGGCCGG CCCCCAAAGA ATCCGGGTCG GCAGACAAGG CAATACGAAA GCGATGGCTC
GTCCAGTTCA ACGGTCCGGT CCGGCCTGAG CAGCGACGGC AGTTGGAGGC GCTCGGCTGC
CGTATCGGCG ATTACATGCC CACGAATGCG TTTGTTGCGC TCATGGACGA CAAGGCGGCC
AAAAGAGTTG CCCTTCTCTC ATTCGTCGAG GACATCACCC GATTTGCACC GGCCGACAAA
CTGGTGGGCA CGGCGCGGAA GGACCTGACG GCGGCACCGA CGTCCGAGAT CCGGATCCGG
AAGGTGCTCC GGGTGGACGA CCCCGCCGAC CGGGCAGCGG TCATTGCCGC CACCCTGCGG
GGAAATGGCC GGATACTGAA CGTGGGTGCA CGCACCATTA CCGTGGAGGT ACCGGAGGAG
CTGCTCGCCC CCCTTGCCCA GCAGGAGGAG ACAGCCTGGA TCGGCGAGGT CGGGGAGTTG
CGGCTGCACA ACAGCGATGC TGCCTGGGTG GTACAGACCA ATGAGGTCGA TAATCGTACG
ATCTGGGAGA AGGGAATTAC CGGCGCGGGG CAGATCGTCG GTATCGCCGA CTCGGGGGTC
GACTACGACA TGCCCTGGTT CGCCGATCCG AACGGCGCTC TTCCCGGGCC GGGACACCGC
AAGATCGTGG GGTACGACGC CACTCTGGGA GACAACCACG ATGTGGCCGA CGGCCACGGC
ACCCACATCG CCGGGACCAT CTGCGGCGAC CGGGGGCCCG GCATGCCCGG CAACGGCATT
GCACCCGGCG CGCGCATCCA CGTCCAGGAC CTGGTCGGCA CCGACGGCAC GCTGACCGGC
AGCCTGGAAC TGGAAACCGT GCTGAAAAAA GCCTATGACA GCGGAGCCCG GATTTTCAAC
GGTAGTTGGG GGGTCGATAG CGGGAACTAC GACGCCCTCG CCGCGGCCCT TGACGACTTT
TCCTGGCGGC ACAAAGATTT CCTGGCCGTG TTCGCCAACG GCAACGGCGG CCCGGCGGAG
CAGACGGCAA CCAGTCCGGC CATAGCCAAG AACGCCACCA GCGTAGTGGC AACCGGCAAC
GGAACCGATG CCGCCACGGT CAGCGCCGAA AGCAGTGTCG GCCAGGCTCC GGATGGCCGT
GCCAATCCGT CGGTGGGCGC TCCGGGCCAG GGTGTGGTCT CGGCTCGCTC CGACGGCCTT
CTCGGCAGCG GCAACAGCGG CACCATGGCC ATGTCGGGCA CCTCGGTGGC AGCGGCGGTC
ACCTCGGGAG CAGCCGCGCT CATCCGCCAG TACTTTACCG ACGGCTTCTT CCCCACCGGC
AGCCCTGTCG CCACCAACAA GCTGCAGCCC TCGGCAGCAC TGCTGAAGGC AGTCCTCGTG
AACAGCGCCG AAGCGCTCCT CTCGGACGAC CCCGGCGACT CGTGCCCCTC GAAGAGGGGC
GGCTGGGGGC GCCCCAAACT GATCAACACC CTCTTTTTCA ACGGTGACAG CCACTCCCTG
GAAGTAGTCG ACGGGGGTAC AGGGCTGGAA ACCGACGGCG TCTGGCAGCG GCTCTACTTC
TCCCCCGGCG GCAGAAGGCT CAAGATCACA CTGGCCTGGA CCGACGCCCC GGCGGCACCC
GGTGCCACGT CCCCTCTCAC CAATGACCTG AATCTGGTGG TTGTCGCACC GGACGGAACG
ACCTACCTGG GCAATGATCT GAACTGTTCC CACGGCGACT ATGAATCACG GACCGGCGGT
TTCTCCGACA GGGTCAACGT TGAAGAGCAG GTGGTCATCA AGCGGCCCGT GGCCGGCACC
TACCTGGTCA AGGTGATCGG CGCCAGCATT CCGGTGGGTC CCCAGCCCTT CGCTCTGGTA
ATGACCGGGG TCACCGGGGT CACCTCGGAC GGCCGGATCG CCCTGACCAA CAGCACTAAC
GGCACCCTGG AGGCGCCGGG ACAGGTGTCG GTCATGGTGA CCGACCGGGA TATCAACCGC
GATGCCTCAG CGATCGAGAC CATGACGGTG GACCTGCTCG GCGAGACCGA GAGCAACCCG
GAGCAGGTGG TCCTCACCGA AACCGGCCCC AATACCGGGA CCTTCACCGG CACCTGCCGG
ATCGCACTCG GCGGCACCGC CATCCATGAC AACGGTGCGC TGGAAGCGCG CCATGGCGAG
ACGATCAGCG CCCGCTATAC CGATGAAATC AACCTGAGCG GCTATCCCCG CCTCGTGAGC
GTGTCGGCCC GGATTGTCGA CAGCGTGCCG CCGACGATCT CGGCGGTTGG CGTCGGAGCC
CAGCTCTCCG AAACCTCGGC GACCGTAGCC TGGACCACCG ACGAACCCGC AGACTCGAAG
GTCAGCTACG GCTCTGACGG CTCCCTGGGT CTCAGTGTCA TCGACGGAGC CTTTGTCTCG
AACCATCTCC TGGCACTCTC AGGTCTGTCG GAAGGACAGG ACTATTTCTT TTCCGTCACC
TCATCCGACG CCGCCGGCAA CGTCTCCACG GATACCAATG GCGGCAACAA CTATACCTTC
CGCACCGCCA GCCTCCCGCC GTCCCTGGAG GTCTTTTGCT CGGCGGAAAA CAACGAGACC
TATCTCCCGA CGGTCAGGGT CTTCGGCACC GCCACCGATC CGGCCGGAAT CGACCGGGTG
ACGGTCAACG GCCAACCAGC CGTCTGGCGC GCCACCGATG GCTACTACGA GGCCACCGCC
TCCCTGGTGC CCGGCAGCAA CACCATTACC GTAATCGCGA CGGATACGCT CAACAACCCG
GCCGGGCAAA CCCTGACGGT CAACAGGACC CTGCCCCCCT TTGATCTGCT CGTGACGTCG
GTCGTCTCTA CCACGAACCT CCTGCCAAGC TCGGCCATCA GGGTGGACGG GACGGTGCGG
AACGAAGGCA CCGCGGATGC CCCGTCGGCG GAGGTTGCCT TTTACCTCTC CCGCGGCGGG
GCTGCCGCCG GCATCCCCCT CGGCAGTATC CCGATCTCAC CCATCCCGGC GGGCCAAAGC
GGCGCAGTTT CCTTCACCGC CAGCTTGCCT CCGGAAGTCG TACCAGGAGT TTACTTCATC
GTCGCAACCG TCGACCCCGC CGACCAGGTG GCCGAAGCCC GGGAGGACAA CAACTCCCTG
ACCGGCAATC CGGTTACGGT GGGCCGCCCC GATCTGGTAC CGCTGACGGT GAGCAACACA
ACCCTCATGT CGCCGGGAGG CACCATTTCG ACGAGCCTGT CGGTGCGCAA CGACGGAGCC
GCCAGCGCAC CCGTTTCAAC AGTCGCTGCC TGGCTTTCGG TCGACACGAA CCTCTCCGGC
GACGATATCC TGATCGGTAC CGCCCCCGCC GCCGCTCTGT CTCCCGGCGC GTCAACCACG
GTCGGCATCA GCGGCGTCCT CCCGCCAGGC ATCCAGTCGG GCACCCTGAA CATTATCGCG
GTCGTCGACG CCTTGGGTGA GGTGGCAGAG GCGGACGAAA ACAATAATCG GGCAACCGGG
CAGCCTCTTA CCATCGGCAC GGCAGAACTA TCCGTGACTA CGGTAACGAT GCCGGCCAGC
ATTGTCCGGG GCTCGACCGC CTCCGCCACC GCAACCGTCG CCAATACGGG TCACTATGCA
GCAACCGGCG TGCGCGTCGG CGTCTATCTC TCGTCCGACA CCGCCATCAC CACCTCCGAC
ATGTTTTTGG GGAGCGGCGT CATTGCATCC CTGGAACCCG GGGCATCGGC GCCGGTGAGC
ATACCAGTTC CCATAACGGA CATCGTTGCC GCGGGGACAT GGTATGTGGG CGCCGTGGTC
GACGATCTCG GGATGATCGC AGAATCCGAC GAGGGCAATA ACGCCCTGGC CGGCAATCAG
GTCGAGATCC TGGCGGACGG CCTTGACCTG ACCGTTCAGG GTGTGACCGC GCCAGCCTCC
GGCACCACCG GTCAACCGGT CACCATTACG GCAACGGTTG CCGCCACCAT GCCGGCGGCG
GCATCCGCGG TCCAATTCTT CGTCTCGCGT GATCCGGTCA TAACCAGCGC CGATACGTAT
CTGGCCACCA AGGCGGTCGG CTCCTTCGGC GCCGCAGGCG CACAGACCGT CACTGCCACC
GTGACGCTTC CCACGACGCT GACCAGCGAC ACCTGGTATC TCGGTGCCAT TGCCGACGCC
TATGGAGTGA TTACCGAAAT CAATGAAACC AACAACGCCT CTGCCGGCCG TGCCATTACG
GTTAACGGTC CCGAGCTTGT CGTGGAATCC CTTACTTCCG CCTCCGACAC GGCTTACACG
GCGGGCACCG TCAGCCTGGC CAGCACCATC AGGAGCATGG CCGGCGCCGC ACCCACCCAC
CGGGTAGAGT TCTATCTCTC CACCGACCCG GCAATCACCA CCTCCGACAT TTACCTCGGC
TACCGCACCG CGTCCCTCCC CGCGGGGGGA AGCAGCACGG CGACCACAAT CCTGACGATC
CCCCGCTACC TGACCGGGGG AGACTATTAC ATCGGTGCCA TCGCAGACCC CGGCAACGTC
ATTGCAGAGG CGAACGAGAA CGACAACTCC CTGGGCATCC CCCTGCACAT CATCGGCCCC
GATCTCCAGG TAGACGGCCT GAGCCTCCCC GGCAGCGCCC TGTCAGGGGT CCCCCTCACC
ATTTCCAGCC GGGCATTTTC GACCCAGGGC GGATCCGGAA GTTTTACGGT CGACTTCTAC
CTTTCCTCAG ACCAGACCAT TACTACCGGC GACGTGTACC TTGGCCGCCG CACGGTGAGC
AGCCTGGCTG TGGCCGGAGC GAGCACCGCA ACGGCCACGG TGACGATTCC GAATTACGTC
TTCACGGGCC GCTATTACGT GGGCGCCATC GTGGACCCCT ACAATTACGT GAAAGAGGAA
ACGGAGACCA ACAACTCGAC CGGGACAGAC CAGGCAGTCG CCGTGGAGGT AACGGGTGCC
GAACTGGCAC TAGCGTCTCT CTCGGCACCG GCCTCGGCCA AGCCCGGCGA AACCATCGCC
GTGGTCAACG CCCTTGCCAC TACGGCAGGC TCGGCGCCCT CATCGTACAT GGAGTTCTAC
CTGTCGACCG ACAGCATCAT CACCGCGGCC GACCGCTATC TCGGAGGCCG CACCGTCTCA
GCCCTGCCCG CGGGGGGGGC CAACAACGCC GCAACCGGCC TGAAGGTTCC AGCGGACATC
CTGCCCGGGA CCTACTACCT CGGCGCCGTG AGCGACCCCT ACAACACGGT CAGGGAGGCC
AACGAGGCTG ACAACACCCG AACCGTGCAG CTCACCGTCA CCGGAAGGGA TCTGACGGTC
GAGGCACTGA GCGGGCCGGC CGCCGCCCTC GCGGGGGCGA CCATCGGAGT CGCCAATGCC
GTGAAGAGCG CCGGCGGCGC AGTGCCGGGC TTCGATGTCA CCTTCTACCT GTCGCGCGAT
GCCGTCATAA CCCGCTCAGA TGCCTACCTG GGGACCCGCT TCGTATCCGG CCTGGACATC
GATGGCGCGA ACACGGTGAC TACCACCCTC AAGCTCCCCA ATGATCTGGA GGGCGGACGC
TACTACCTGG GTGCCATCGT CGACGGTGGC AACCTGATCC CCGAGACCGA CGAGAGCAAC
AACGCATCGG CTGCGGCCCC CATCGACCTG GTTGGCGCCG ACCTCGCGGT CTCGGCGCTG
ACCGCGCCGG CTACGGCCAG TGCCGGCGAA ACGATCTCCG CCCAGGTGAT CGTCACAACC
CGTGCCGGCG GCTCTCCTTA CTCGCTCGTT AACTATTACC TGTCAACCGA CGAGACTGTC
TCGCCAGACG ACATCTATCT GGGAGTATCC ACCATACCTT CACTGGGCCC GGGCGGCGGC
GCCACTGTCG GCAAGTCGGT GAAACTGCCC GCGGACATGG AGCCGGCAAC CTACTACCTG
ATTGCCGTTG CAGACCCTGC CAATGCAGTG GCAGAGGCGG ATGAAACCAA CAATACCTCT
CAACCGCGCG CCATTGCGGT GACTGTACCC TAA
 
Protein sequence
MKLRRCIASL ATVGMLAMLL VQAEPRQVTG SGYSAETGKR QRPAPKESGS ADKAIRKRWL 
VQFNGPVRPE QRRQLEALGC RIGDYMPTNA FVALMDDKAA KRVALLSFVE DITRFAPADK
LVGTARKDLT AAPTSEIRIR KVLRVDDPAD RAAVIAATLR GNGRILNVGA RTITVEVPEE
LLAPLAQQEE TAWIGEVGEL RLHNSDAAWV VQTNEVDNRT IWEKGITGAG QIVGIADSGV
DYDMPWFADP NGALPGPGHR KIVGYDATLG DNHDVADGHG THIAGTICGD RGPGMPGNGI
APGARIHVQD LVGTDGTLTG SLELETVLKK AYDSGARIFN GSWGVDSGNY DALAAALDDF
SWRHKDFLAV FANGNGGPAE QTATSPAIAK NATSVVATGN GTDAATVSAE SSVGQAPDGR
ANPSVGAPGQ GVVSARSDGL LGSGNSGTMA MSGTSVAAAV TSGAAALIRQ YFTDGFFPTG
SPVATNKLQP SAALLKAVLV NSAEALLSDD PGDSCPSKRG GWGRPKLINT LFFNGDSHSL
EVVDGGTGLE TDGVWQRLYF SPGGRRLKIT LAWTDAPAAP GATSPLTNDL NLVVVAPDGT
TYLGNDLNCS HGDYESRTGG FSDRVNVEEQ VVIKRPVAGT YLVKVIGASI PVGPQPFALV
MTGVTGVTSD GRIALTNSTN GTLEAPGQVS VMVTDRDINR DASAIETMTV DLLGETESNP
EQVVLTETGP NTGTFTGTCR IALGGTAIHD NGALEARHGE TISARYTDEI NLSGYPRLVS
VSARIVDSVP PTISAVGVGA QLSETSATVA WTTDEPADSK VSYGSDGSLG LSVIDGAFVS
NHLLALSGLS EGQDYFFSVT SSDAAGNVST DTNGGNNYTF RTASLPPSLE VFCSAENNET
YLPTVRVFGT ATDPAGIDRV TVNGQPAVWR ATDGYYEATA SLVPGSNTIT VIATDTLNNP
AGQTLTVNRT LPPFDLLVTS VVSTTNLLPS SAIRVDGTVR NEGTADAPSA EVAFYLSRGG
AAAGIPLGSI PISPIPAGQS GAVSFTASLP PEVVPGVYFI VATVDPADQV AEAREDNNSL
TGNPVTVGRP DLVPLTVSNT TLMSPGGTIS TSLSVRNDGA ASAPVSTVAA WLSVDTNLSG
DDILIGTAPA AALSPGASTT VGISGVLPPG IQSGTLNIIA VVDALGEVAE ADENNNRATG
QPLTIGTAEL SVTTVTMPAS IVRGSTASAT ATVANTGHYA ATGVRVGVYL SSDTAITTSD
MFLGSGVIAS LEPGASAPVS IPVPITDIVA AGTWYVGAVV DDLGMIAESD EGNNALAGNQ
VEILADGLDL TVQGVTAPAS GTTGQPVTIT ATVAATMPAA ASAVQFFVSR DPVITSADTY
LATKAVGSFG AAGAQTVTAT VTLPTTLTSD TWYLGAIADA YGVITEINET NNASAGRAIT
VNGPELVVES LTSASDTAYT AGTVSLASTI RSMAGAAPTH RVEFYLSTDP AITTSDIYLG
YRTASLPAGG SSTATTILTI PRYLTGGDYY IGAIADPGNV IAEANENDNS LGIPLHIIGP
DLQVDGLSLP GSALSGVPLT ISSRAFSTQG GSGSFTVDFY LSSDQTITTG DVYLGRRTVS
SLAVAGASTA TATVTIPNYV FTGRYYVGAI VDPYNYVKEE TETNNSTGTD QAVAVEVTGA
ELALASLSAP ASAKPGETIA VVNALATTAG SAPSSYMEFY LSTDSIITAA DRYLGGRTVS
ALPAGGANNA ATGLKVPADI LPGTYYLGAV SDPYNTVREA NEADNTRTVQ LTVTGRDLTV
EALSGPAAAL AGATIGVANA VKSAGGAVPG FDVTFYLSRD AVITRSDAYL GTRFVSGLDI
DGANTVTTTL KLPNDLEGGR YYLGAIVDGG NLIPETDESN NASAAAPIDL VGADLAVSAL
TAPATASAGE TISAQVIVTT RAGGSPYSLV NYYLSTDETV SPDDIYLGVS TIPSLGPGGG
ATVGKSVKLP ADMEPATYYL IAVADPANAV AEADETNNTS QPRAIAVTVP