Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2912 |
Symbol | |
ID | 2688563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3206802 |
End bp | 3209942 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637127605 |
Product | cytochrome c family protein |
Protein accession | NP_953954 |
Protein GI | 39998003 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.802585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGAA TACTACAGAG GACTTTCGTC ACGGCATTAG TGACGCTCTT CGTCCCGATC GCCGCGTGGG CCATCGACTA TCCCCACGAA CTCGGCAGCG TCAAGGGCTA CAAGTGTTCC TCCTGCCATA CCGTTCACAG CACCCTCGGT TCCACGGGGT ACAACAACGT CTGCCTCACC TGCCACAACC CGAGCGATCC CTACGGCTCA GGCAAGCCCT TTGCCATGAC CGACTTCGCC AACCCCTTCC GCACCTGGAC GAGTGCCCGG CCGGCGGTGA CGTACCAGAC CTCCCACAAC TGGATCGGCA AGGACGTGGT GCCCAAGGCG GGCGCGGTGG CTCCCACGGA CGTCAGGCTC ACCAAGTCGC TCCTCATCGG AACCGTGTCC TGCGCCCGCT GCCACAACAT CCACGACCTC TATTCATCGG CCTACAACTC GAAACCGTTC CTCCGGGTGC GCAACGACGA GGACCAGCTC TGCCTCGACT GCCACCGGCC CCGCAAGACC GTTGACCACA CCAGGGGATC CCACCCCGTC ACGGTGAACT ACTCGACCAA GGCCGCGGCC CGGCCCGACG AATTCTACAG CCCGCCGCGC AACAGCAACC CGGCCAACCC CACATCGGCC ATGAGGATGT CGCGCAGCGG CGCGGTCGTC TGCACCACCT GTCACGGGGT GCACTACACC GACTCCAACA GCCGCACCTT CGACAATGCA TCAAGCGCCC GGATGGGGCT CCTTTCCACG TCCCGGGGCA TGCTGCTGCG CACGGACCTG AAGGGAAGGA CCGTCAGCGA TCCCAACATC TGCACCAACT GCCACAAGTC GGCCGACGAC CCGGCAAACA CCACTGCCCG CGTCAAGAAC CACAACGGCA CCAAGAACCA GAACGTCCAG TGCGCCGACT GCCACGGCGG CCACGTGGAC GAGGCTGACG GGACCGCCCC CAACGCCTAT CTCATCAACC GGTACATGAA CATCTCCACC CAGTACGGAG CGGTCCGCAA CGCCAAGGTC ATGTACCAGT ACACCTCGGT TACCCGGAAA AACTGGAACA AGGATGCCTT CGGCGTCTGC CTGGCCTGCC ACTCGCCCCT GCCGGGAACC ATCGGGCAGC ACTCGAGCAC CAACGCAGCC GACTGCCGGA GCTGCCACAC CCACTCCCAG GGCTTTTCGG CCAACTGTAC CCAGTGCCAC GGCTTCCCCC CCACCGTCAA CACGGCCGGC GGGCCTTCGG GCTACGGCAA GGACGGCGTT CGTGACTATT CAACCTCCGG TGTGTTCAAG AACGAAACCC AGACCCCCCA CGCCCGGCAT GCCGGCGGCG GTGCCAACTA CAGCATAGCG TGCGACCAGT GCCACAAGGG CTTCAGTCAC AATACCGGCA CGTTCCAGGA TGTGTTCGTC GACAAGACCG GCATCATCGC CAGTATCGGC TCCGGCGCCA ACCCCACCTA CAACCCGGCC GGCAACGGCA CCTGTACGGC GACCTACTGC CACTCGGACG GCGCTCCCCG CAACGCTTCG CTGCAGGCCG TGGTCGGCGC GCCGGGCAAT ACGACGTTCA CGTGGGGCAA CGGCGTGGGC AAGCTGTCGG GCTGCAGTTC GTGCCACGCG GCATTTCCCC TGACCAACGC CCACCCGGCC CACTTCAACG CAGGGATCAT GAACTGCCAG AACTGCCACT ACGCGACGCT CAGCGGCCCC TCCACCATCA AGGACAAATC GGTCCACGTG GATGGCGTGA AAACCGTCGT CTTCAACGGC GTCGCCCGGG GAACCATCAA CGTCGGCAGC GGTACCTACA ATGTGAACAC GGCGACCTGC AGCGTTGCCG GATGCCACGG CGACGGCCGC GGCGGCGCGC CGGTGACGAC TCCCCAGTGG GTCGATCCGA CGACCGGCGC GTGCGGTAAG TGCCACTACG CCACCCCCAC CATCGCCGCC ACGAGCGCCC AGACCATCGG GACCAACGGC CACGCGGCCC ACCTGACCCT CGGCTATGGT CCCAAGGCCA TCCTGGGGGC AACGGTATCC GCCTGCCAGA GCTGCCACAC CTACACGACC TCAACGGCGA TCACCCACGT GAACGGCTCC GTCCAGGTGG TTTCGGCCAA CTGTACGTCC AGTTGCCACA AGAACGGCGC CACCTGGACC TCGGGCCGCG TGACCTGCGA AAGTTGTCAC ACGGGGCTCC TCTCGGTCAT CGGCGGCAAG ACCGCACCGG CAAAAACGAA TTTCACCGCC AGCGGCCACG GCCAGGCCGG GGCCGGCTTC AACGCCAGCC GCCAGTGCGC AAGCTGTCAC GATGCCGACA GCGGGCACAT CAACGGCACT TCGGGCGACC AGAAGCGGAT CGCCGTGAAC GACTACACCC TCTGCGCAGG CTGCCACAAT GACGCCATCA AGGTCCCCAC CGCCACCAGG CGCAACGTGA CCCGCCATGC CGTGGACCTG GGCAACTACA CCATGGAGTG CAAGACCTGC CATGACGTCC ACGGCACCGG CAACCGGGCC ATGGTCCGGA CCACCCTTGT TTTCGGGGCA CTGACCTCCA CCATCAGCTA TGCCACCACG GCCGATCTCG TGCAGCTACA GGCTCCGTAC CGCGGCGTAT GCCAGACCTG CCACACCAAA ACGAGCCATT ATCGGCGCGG CGTCAACGAA GGATCCGGCC ACCCCACAAC CGGCTGCCTG AACTGCCACT CCCACCGCAA CACGTTTGCC TTCAAGCCTA AGGCTTGCGA CGAGTGTCAC GGCTACCCGC CGACACCCAA GGGCTTCGTT GCCAGCCAGG CCAACTACTC GACGGCACGT CTCGAGAACT ACTCGGGCGG CGGCGGCGCC CACGTGAAGC TGGGTCACCT CATGTCGAAC CTGCGGCCGT CCCAGGGCTT CACCCCATGC CTCACCTGCC ACTACGACGG TGCCGCGGCC CACGTGGGCG ACGAGTCGGT CTGGGCCGGC GGCAGCACCC AGCAGAAGAA GTCGGCAGTG AACGTGAAGA TCGATCCGAC GTACAAGTTC AACGCTGATA AGGGACAGTG GTACTCGAAG CAGTCCCCGG ATGCCACCGG CAGTTGCTGG AACGTCAGTT GCCACTTCCA GCCGACGCCG CGCTGGTCCG ACGACAAGTA G
|
Protein sequence | MNGILQRTFV TALVTLFVPI AAWAIDYPHE LGSVKGYKCS SCHTVHSTLG STGYNNVCLT CHNPSDPYGS GKPFAMTDFA NPFRTWTSAR PAVTYQTSHN WIGKDVVPKA GAVAPTDVRL TKSLLIGTVS CARCHNIHDL YSSAYNSKPF LRVRNDEDQL CLDCHRPRKT VDHTRGSHPV TVNYSTKAAA RPDEFYSPPR NSNPANPTSA MRMSRSGAVV CTTCHGVHYT DSNSRTFDNA SSARMGLLST SRGMLLRTDL KGRTVSDPNI CTNCHKSADD PANTTARVKN HNGTKNQNVQ CADCHGGHVD EADGTAPNAY LINRYMNIST QYGAVRNAKV MYQYTSVTRK NWNKDAFGVC LACHSPLPGT IGQHSSTNAA DCRSCHTHSQ GFSANCTQCH GFPPTVNTAG GPSGYGKDGV RDYSTSGVFK NETQTPHARH AGGGANYSIA CDQCHKGFSH NTGTFQDVFV DKTGIIASIG SGANPTYNPA GNGTCTATYC HSDGAPRNAS LQAVVGAPGN TTFTWGNGVG KLSGCSSCHA AFPLTNAHPA HFNAGIMNCQ NCHYATLSGP STIKDKSVHV DGVKTVVFNG VARGTINVGS GTYNVNTATC SVAGCHGDGR GGAPVTTPQW VDPTTGACGK CHYATPTIAA TSAQTIGTNG HAAHLTLGYG PKAILGATVS ACQSCHTYTT STAITHVNGS VQVVSANCTS SCHKNGATWT SGRVTCESCH TGLLSVIGGK TAPAKTNFTA SGHGQAGAGF NASRQCASCH DADSGHINGT SGDQKRIAVN DYTLCAGCHN DAIKVPTATR RNVTRHAVDL GNYTMECKTC HDVHGTGNRA MVRTTLVFGA LTSTISYATT ADLVQLQAPY RGVCQTCHTK TSHYRRGVNE GSGHPTTGCL NCHSHRNTFA FKPKACDECH GYPPTPKGFV ASQANYSTAR LENYSGGGGA HVKLGHLMSN LRPSQGFTPC LTCHYDGAAA HVGDESVWAG GSTQQKKSAV NVKIDPTYKF NADKGQWYSK QSPDATGSCW NVSCHFQPTP RWSDDK
|
| |