Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0709 |
Symbol | |
ID | 2687315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 756029 |
End bp | 759223 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637125381 |
Product | hypothetical protein |
Protein accession | NP_951766 |
Protein GI | 39995815 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGC CCGAAACCGC CCCCCTGGCG CCGTTTCCCC CCGAAGCGGC CTACGAACCG TCCCTCGACC GGCTCATGCG CCTTGCCGCA TGGGCCGGCG CCATCCGGGG GGACGGCACG CCGGTGAACT TCACCTCCAT CCTCGTGGGG GCAATTCTCG TGGAGGACGA GACCGGCGCC TGGTTTACCC GCTACGCCCG GGAAACGGGA GAACGGGCCG AGGCTATCGC CGACCTGGTG CAGACATCCG AGGCGATGCG CCCAAGGGTT GAGGAGCAGG CAGCCGCCGG CACCCTCCCC GATGCCAGGG AGTTCTACTC CTCGTCGGTG CGCGACATCC TCGGGTCGGC GCTGCGGCTC GCCACCCGCG ACGGTGCCGA GCCCCGCCCC CTGGGGACGC GCCACCTGCT GGCGGCCCTC GCCTTCCGCC TCTCCGACTA CCACCGGCGC CAGCTCATCG GCTGGGGGGT CGACATCGCC GCCTGGCAGC GCAGCACCCT CGACTTCCTC CAGCGGGCCT TCCCGTCGGA GGATTCCACC TGGGTCTCCC TGGCCCTGGA GCAGGCCGAG GCGCCGCCCC CGGAACCTCC TCCCGCCGAC ACCATCTTCA AGAGTTTTGC CGCCGAGGCC CCCGTCCCGC CCGTCAGTGC CGGCCCCGAC GTCCTCATTG CCCGGTTCAC CGCCGACGAT CCCACCTCGG CCACGGACCT CATGGACGTC CGGCAGGAGG CCCGGGCCTT TGCCCGCCTC GCCGCGGGCC GCGCCATCCG CCCGCCCCTC TCCATCGGGG TATTCGGCGA ATGGGGGTCG GGCAAGACCT TCTTCATGAA GCTGATGCAC GAGCATGTGG CCCGGATTTC CCGGGATGCG GACAGTGACG GCCCCTTCCA CGGCAACATC GTCCAGATCC GCTTCAATGC CTGGCACTAC GTGGAATCCA ACCTGTGGGC CAGCCTGGTG GATTACATCT TCACCGAACT GGACCGCTGG CTCAAGGAGC GCCCCGAAAA CCCCAACGAG ACCGTGGATC TTCTCTTCGA CCACCTCTCC ACCTCCCGCA CCCTCAGGCT GGAGGCCATG GACGAACTGG TGCTCCGCCG CCGCCAGCGC CAGGAGGCCG AAGAAAACCT GAAAACCGTC CGGGCCCGGT TCGAGACGGC TCTGGCCCGC CAGGCCGTGG CAACGGCAGG CGAATTCTGG ACCGCCGTGG GCACGACCCT GGGCAAGGAC CGGGAGACGC GCGAGACCAT CGACCGCGCC GGCCGGGTCC TGGGAGTTCC GGAACTCTCC GCCTCGGCCC GCCAGTTGAG CGAAGTGCTC CAGGAAACCA CCACCCAGGC CGGACGGGCC CGGACCCTCG GGCGGAGCGC CATGGCCACG CTCGGCCGCC CCCGGTGGCT GGCGGCCCTG GCCCTGATCC TCGTGGCGGC GCCGGTCGCC GTGGTCTGGT TCCGGGACAT CCTGGGTCGG ACAGAGGTGC TCTCCTGGCT GAAGGAGGTG AACGCCGCCG TTCTGGGACT CTCCTCGGTC ATGGCCTCCG TGGCCGGGTT CGCGGGTACG GCCCTCAAGC GGACCGCCAC CGCCCTGGAC ACCCTCGAAG GCTTCCGCGC CAACCTGGAA ACGGCCATTG CCGAGCGGAC CGAGGAATTC AGAAAGAACA GCAATGAAGC GGCCCTGCTC ACCGATGCCG AGCGCGAGGC GACGCGGCTC AGGAGCGACG TGGAGCAGGC CGAGCGCCGC CTGGCCGATG CCGACCGCCG CGCCGCCGAC GCGGCCCGTG ACTTCCAGGG GGCCACGGCC CGGGGGCGGC TCAATGCCTT CATCCGCGAC AAGGTCGTCA GCGGGGACTA CGCCAAGCAC CTGGGTATCA TCGCCACCAT CCGCAAGGAT TTCGGCCAGC TGGCCCAGCT CATGGCCGAC GCCGATCGCG ACCGGGAGCT GGCCGAGGAA TATGAGCGGG CCCGGCTCGA CTACGTCCGC AGGGTGGAAG AACTCATCGC CCGCTCCGGC GACGTGCTGA CCGACGAGGA GGTCGCCGCC CTCCGGGCCT CCACCACCTT CGACGCCGAA TCCCTGCGCC TGTTCGAGCG GATCATCCTC TACATCGACG ACTTGGACCG CTGCCCGCCC GAAAAGGTCG TGGAGGTCCT CCAGGCCATC CACCTGCTCC TCTGCTTCCC CCTCTTCGTG GTCGTCGTGG CCGTGGACGC CCGCTGGGTC TCCCGCTCCC TCAAGGAGGT CTACCCGGAG CTCCTGGCCG AAACCGTCAT CATCCCCGGA ACCGGCGCCA CGGCACCCGC CCGCCCCGGC GCCCCCGCCG GCGAAGCACG GCATGATGCC ACCGACCGGC GCACCGCTCC CAACAACTCA GACCGGGAGC GGGCCGCCTC GTCCCAGGAT TACCTGGAGA AGATCTTCCA GCTTCCCTAC TGGGTGCGCG CCATGGATGC CGACGCCTGC CGGAACTATA TCAAGGGGAT CGTGGCAGCC GAGTCGACAG TCCAGGCCGA CCAAGCACCC CTGTCGCCCG AACCGGCTCC GAACGCCGCC CCGCCGGCGC CTTCGCTCCA ACCGCCGACA GAGCGGGCGG GCGGAGCGAC TGAAGTAGCG CCCTCAGAGG CACCGGTCGC GTCCCCGCCC CCCGCTGCGC CGTCTCAGCC CGACAGCGGC GGGGAGTCCT TCCGCGAGCA CGAGGCCCGC ACCATGACCC TCACCCCCCA TGAAACCGCC TTCATGGCCG AACTTGCCCC CCATGCCGGG GGCACCCCGC GTCGGGGACT GCGCTTCGTC AACGTCTACC GGTTGATCCG CACCAGCCTG CCGCTCCACG AGCACGAGAC CCTGGTGGGC GACAAGGGGG AACAGACAGC CTACCGGGCT CTCCTGACCC AGTTGGCCAT TGTCACCGGG GCGCCGGACA TCGCCCCCGT CTACTTCGAC CACTTGGCCA TCCTCGCTGC CGGCAACCTC GCCGAACCCA AGGAGCGCAA GGGGCTGGTC GACCTGATTG CCGCCCTGGG CGAAGACGCC CGCGTCACCG CTAGCACCGA AGCCGCACCA CTCCTGGGTG CGCTCAAGGC CTTGCACAAC AGCGTCGCGC CCCAGGGGCT CGGCAACGAG CCAACGCTCC TGGCCACTCT CCACGACACC TCCTCCGTGG CCCGGCGTTA TTCCTTCACC GCACGACCCC ACTGA
|
Protein sequence | MDKPETAPLA PFPPEAAYEP SLDRLMRLAA WAGAIRGDGT PVNFTSILVG AILVEDETGA WFTRYARETG ERAEAIADLV QTSEAMRPRV EEQAAAGTLP DAREFYSSSV RDILGSALRL ATRDGAEPRP LGTRHLLAAL AFRLSDYHRR QLIGWGVDIA AWQRSTLDFL QRAFPSEDST WVSLALEQAE APPPEPPPAD TIFKSFAAEA PVPPVSAGPD VLIARFTADD PTSATDLMDV RQEARAFARL AAGRAIRPPL SIGVFGEWGS GKTFFMKLMH EHVARISRDA DSDGPFHGNI VQIRFNAWHY VESNLWASLV DYIFTELDRW LKERPENPNE TVDLLFDHLS TSRTLRLEAM DELVLRRRQR QEAEENLKTV RARFETALAR QAVATAGEFW TAVGTTLGKD RETRETIDRA GRVLGVPELS ASARQLSEVL QETTTQAGRA RTLGRSAMAT LGRPRWLAAL ALILVAAPVA VVWFRDILGR TEVLSWLKEV NAAVLGLSSV MASVAGFAGT ALKRTATALD TLEGFRANLE TAIAERTEEF RKNSNEAALL TDAEREATRL RSDVEQAERR LADADRRAAD AARDFQGATA RGRLNAFIRD KVVSGDYAKH LGIIATIRKD FGQLAQLMAD ADRDRELAEE YERARLDYVR RVEELIARSG DVLTDEEVAA LRASTTFDAE SLRLFERIIL YIDDLDRCPP EKVVEVLQAI HLLLCFPLFV VVVAVDARWV SRSLKEVYPE LLAETVIIPG TGATAPARPG APAGEARHDA TDRRTAPNNS DRERAASSQD YLEKIFQLPY WVRAMDADAC RNYIKGIVAA ESTVQADQAP LSPEPAPNAA PPAPSLQPPT ERAGGATEVA PSEAPVASPP PAAPSQPDSG GESFREHEAR TMTLTPHETA FMAELAPHAG GTPRRGLRFV NVYRLIRTSL PLHEHETLVG DKGEQTAYRA LLTQLAIVTG APDIAPVYFD HLAILAAGNL AEPKERKGLV DLIAALGEDA RVTASTEAAP LLGALKALHN SVAPQGLGNE PTLLATLHDT SSVARRYSFT ARPH
|
| |