Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3543 |
Symbol | |
ID | 8138915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4099593 |
End bp | 4102466 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871162 |
Product | cytochrome C family protein |
Protein accession | YP_003023322 |
Protein GI | 253702133 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 142 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTTTA GGTATGGAGG ACGATGTTTG TTTGTGCTGG CACTTCTGGT GCTGACACTG ATGCAGCTTA CCGCGGGACA GGCACAAGCC GCCTCGCAGT ACAGCTATGA CTGTACCTTC TGTCACCAGA TGCCGCCGCT GGATTCGGCT AACGCGAAGA AGGACCCCAA CACGGGCGCC ATTCCCGGCA ACCACCAGGG CCACGCTTCG GCAGCGGTCA ACTCCTGCGC CAAGTGCCAC GGCGACGCGA GCGGCTACGC CATGGGGCAC AGGAACAAGA CCATCGAGCT CGCCGACGGC ATCGGCTACT CCAGGAAGAT CGCGGCCGGG TTCGTGAACC AGACCTCGGT TCCGCCGAAC CCGATGGGAA CCTGCTCCAC CGCGGCCTGC CACAGTAACG GCAAGGGCGT GTTGAAGACG ACCCCGGCCT GGGGCGCCGC ACCGTTCCAG GCTCCCGGCG ACTGCTCCCA GTGCCACGAC GTCGCTCCCG CCTCCGGCAA CCACCCGACG CTGGGCAGCA AGCACGCCGC CTACTTCGGC ACCGGTACCG GCTCCTGCGT CAAGTGCCAT ACCGACCACA CCGCGCAGGC CAAGCCGTTC AGCCACGCGA CCAGCGCCGG CCGCGCCATC GAGGTGACCT TCGCCGACGG CGGCTCCTTC GCGGCCAACC AGTGCTCCAA CGTCTCCTGC CACAGCAACG GCCAGGGGAG CTTCACCCCG CCGACCTGGG GCGCCACCCT TGACTGCGCC GGCTGCCACG GCACCGCGAC CAGCGACACC CTCTCCGGCA AGCACGGCAA ACACGTGAAC AACGCCGGCT TCCTCGGCAC CAACTACGGC TGCGTGGAGT GCCACTCCGC CACCGTGATC GACAACAGCA CCATCGGCAA CTTCGGGAAC CACGTCAACA AGACCGCCGA CGTCGCGGGC GCCCACGTAG GGACCCCGGT CGCCGGAAGC TGCTCCACCT CTTACTGCCA CAGCGACGGC AAGGGGACCC AGAAGAGCGT GACCTGGACC CAGACCGAAG TTCTTACCTG CAAAAGCTGC CACGGCTCCG ATGCGGCTCC CGCCTTCGCC TCGGTCGCCG GCGAGCCTAA CTACGTGAAC GCCGGCCCCG ACCAGCCGCG CGCCAACAGC CACCAAAACC ACGTCGTCGG GGCTGCGAAC TGCCAGAACT GCCATAGCAG CACCACCGTG GACGGCCTCA CCATCAAGGC CGGCACTGCG CACACCGACG GCACCCGCAC CGTCGCAGCC GGCAACGGCA AGAGCTTCGT CATCGCAGCG AACAGCTGCT CCGCGGTATC CTGCCACGAC GGCGGCGGCA TCGTAGCAGG CGTCGGCGCT GCCAAATGGG GCCAGTCGCT TGGTTGCGCC GGTTGCCACG GCGACGCGGC AACCCTTACC ACCAACGCGC ACGCGGCCCA CGTTTCCACC AAGGGCTATG CCTGCGACAC CTGCCACGCC CAGACCGTTA CCGGCAGCAC CAACTTCGCG AACAAGGCGC TGCACGGCGA CGCCATCGTA GAGGTTGCCG GCGTCAAGGT AACGAGCTTC GCCACCGACA CCAAGAACTG CGCCACTTCC TGCCACCTTT CCGGCGCACC TAAGTGGAAC GACACCGCTT CCGGCGCTTG CGGCACCTGC CATAACGCGC TTTCCACCAC CGTTAACGGC CTTGTTTCCA GCAACGCCCA CTCCGCCCAC TTCACGGCTA CCTACGGCCC GGGCATGAAC GGCGCGGCGG CCACCTCCTG CGCCGGCTGC CATACCCCCA ACACCGCGGC GAGCCACGCA GACGGCACGC TCAACCTCGC CATCGGCTAC AACAAGATCG GCACCTGCTC CAGCTGCCAC AAGCAGAACA CCAACTGGAC CACCGGGCGC GTTTCCTGCG AAAGCTGCCA CAGCACCGCA GGGGGCGAGC TCTCCGTCAT CGGCGCCCTC GCCGCTCCGG ACAAGACCGG TGCTGCGACC GCGGGTCACG GCAAGGCGGG CATCGACCAG TCCTGCTCCG CTTGCCACGA CGCCAACTCC GCTCACATCA ACGGCGTAGC CGGCGACAAC AAGCGTCTGC TTGGTGCCTT GACCGGCGCC GACAACCAGG AGTGTAACTA CTGCCATGCC GACGCCGGCA AGGTCGCCGG CGCCGCTCTC AACGTGAAAG CCCACCGCCT CACCGGCGAA CTCGGCGCGA AGTGCTCCGA CTGCCACAAC GCGCACGGTA CCGCCAACAG CATGATGGTT AACGGCACGA TCAACGGGAC CGAGGTGAGC TTCACCACCG TCGACAGCTT CGCCAACGGA GCACGCACCG GCGTCTGCCA GGTCTGCCAC ACCACGACCC AGTACTTCAC CAAGGCCGGC CAGCCGGAGG CGACCCACGT CGACTCCACC GCGAACTGCC TCGAGTGCCA CCAGCACAAC CCGGCAACCG GCCTCGCCTT CGTTCCCAAC GGCGGATGCG ACGCCTGCCA CGGCTACCCG CCGGCCCCGA GGCAGACCAT CTCCGCGGTC ACCTTCGGCG TCATGGGCAA CTGGTCCACC GCCCGCTTCG AAGACTACTC CGGCGGCGGC GGTGCCCACC TGGTCGCAGC GCACATCAAG AAAGACGCCA AGCCGTCCGA GGGTTGGGCA AACTGCCTCC CCTGCCACAA CGGCGGCAAC GAAGCGCACG CAAGGGCTCT GCCGATCAGG AACCACGTGG ACAGCGTCAC CGTCCAGATC GATCCGCAGT ACCGCTTCAG CGACGATCCG TTCCTGGGCT ACACCTCCGC CCAGCTGGTC AACGGCGGGG TCAACAAGTC CGGAAGCTGC TTCAACGTGA GCTGCCACTT CAAGAAGACC AAGCCGTGGA GTATCGAAAG GTAG
|
Protein sequence | MLFRYGGRCL FVLALLVLTL MQLTAGQAQA ASQYSYDCTF CHQMPPLDSA NAKKDPNTGA IPGNHQGHAS AAVNSCAKCH GDASGYAMGH RNKTIELADG IGYSRKIAAG FVNQTSVPPN PMGTCSTAAC HSNGKGVLKT TPAWGAAPFQ APGDCSQCHD VAPASGNHPT LGSKHAAYFG TGTGSCVKCH TDHTAQAKPF SHATSAGRAI EVTFADGGSF AANQCSNVSC HSNGQGSFTP PTWGATLDCA GCHGTATSDT LSGKHGKHVN NAGFLGTNYG CVECHSATVI DNSTIGNFGN HVNKTADVAG AHVGTPVAGS CSTSYCHSDG KGTQKSVTWT QTEVLTCKSC HGSDAAPAFA SVAGEPNYVN AGPDQPRANS HQNHVVGAAN CQNCHSSTTV DGLTIKAGTA HTDGTRTVAA GNGKSFVIAA NSCSAVSCHD GGGIVAGVGA AKWGQSLGCA GCHGDAATLT TNAHAAHVST KGYACDTCHA QTVTGSTNFA NKALHGDAIV EVAGVKVTSF ATDTKNCATS CHLSGAPKWN DTASGACGTC HNALSTTVNG LVSSNAHSAH FTATYGPGMN GAAATSCAGC HTPNTAASHA DGTLNLAIGY NKIGTCSSCH KQNTNWTTGR VSCESCHSTA GGELSVIGAL AAPDKTGAAT AGHGKAGIDQ SCSACHDANS AHINGVAGDN KRLLGALTGA DNQECNYCHA DAGKVAGAAL NVKAHRLTGE LGAKCSDCHN AHGTANSMMV NGTINGTEVS FTTVDSFANG ARTGVCQVCH TTTQYFTKAG QPEATHVDST ANCLECHQHN PATGLAFVPN GGCDACHGYP PAPRQTISAV TFGVMGNWST ARFEDYSGGG GAHLVAAHIK KDAKPSEGWA NCLPCHNGGN EAHARALPIR NHVDSVTVQI DPQYRFSDDP FLGYTSAQLV NGGVNKSGSC FNVSCHFKKT KPWSIER
|
| |