Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3564 |
Symbol | |
ID | 8138936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4137082 |
End bp | 4139949 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871183 |
Product | cytochrome C family protein |
Protein accession | YP_003023343 |
Protein GI | 253702154 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 126 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACCTA AGTATGGAGG ACGATGCTTA TTTGTGCTGG CGCTTCTGGT GCTGACACTG ATGCAGCTTA CCGCGGGACA GGCACAAGCC GCCTCGCAGT ACAGCTATGA CTGTACCTTC TGTCACCAGA TGCCGCCGCT GGATTCGGCT AACGCGAAGA AGGACCCCAA CACGGGCGCC ATTCCCGGCA ACCACCAGGG CCACGCTTCG GCAGCTGTCA ACTCCTGCGC CAAGTGCCAC GGCGACGCGA GCGGCTACGC CATGGGGCAC AGGAACAAGA CCATCGAGCT CGCCGACGGC ATCGGCTACT CCAGGAAGAT CGCGGCCGGG TTCGTGAACC AGACCTCGGT TCCGCCGAAC CCGATGGGAA CCTGCTCCAC CGCGGCCTGC CACAGTAACG GCAAGGGCGT GTTGAAGACG ACCCCGGCCT GGGGCGCCGC ACCGTTCCAG GCTCCCGGCG ACTGCTCCCA GTGCCACGAC GTCGCTCCCG CCTCCGGCAA CCACCCGACG CTGGGCAGCA AGCACGCCGC CTACTTCGGC ACCGGTACCG GCTCCTGCGT CAAGTGCCAT ACCGACCACA CCGCGCAGGC CAAGCCGTTC AGCCACGCGA CCAGCGCCGG CCGCGCCATC GAGGTGACCT TCGCCGACGG CGGCTCCTTC GCGGCCAACC AGTGCTCCAA CGTCTCCTGC CACAGCAACG GCCAGGGAAG CTTCACCCCG CCGACCTGGG GTGCCACCCT TGACTGCGCC GGCTGCCACG GCACCGCGAC CAGCGACACC CTCTCCGGCA ACCACGCCAA ACACGTGAAC AACGCCGGCG TGCTCGGCAC CAGCTACGGC TGCGTCGAGT GCCACAGCTC CACCGTGACC GACAACAGCA CCATCGGCAA CTTCGCGAAC CACGTCAACA AGAGTAAGGA AGTGGCGGGC GCCCACGTAG GGACCCCGGT CGCCGGCTCC TGCTCCACCT CCTACTGCCA CAGCGACGGC AAGGGGACCC AGAAGAGCGT GACCTGGACC CAGACCGAAA CCCTGGACTG CAAAGGGTGC CACGGCTCCG CAGCACCGGC TTCCTTCGCC TCCATCGCCG GCGAGCCTAA CTACGCAAAC GGCGGCGTCG ACCAGCCGCG CGCCAACAGC CACGAAAACC ACGTCTTCGG GGCCGCCAGC TGCCAGAACT GCCATAGCAG CACCACCGTC GACGGCCTCA CCATCAAGGC AGGCGCTCCG CACACCGACG GCACCCGCAA CGTGGCAGCC GGCAACGGCA AGAGCTTCAC CATCGCAGCG AACTCCTGCT CCGCGGTATC CTGCCATGAC GGCGGCGGCA TCTTAACAGG CGTCGGGGCT GTCAAATGGG GCGGGACCCT CGGCTGCGAC GGCTGCCACG GCGACGTCGA AACGCTCGCC ACCAACGCCC ATACGGCGCA CGTTGCCACC AAGGGCTATG CCTGCGACAC CTGCCACGAG CAGACCGTTT CGGGGAGCTA CAGCTTCGTC AACAAGGCGC TGCACGGCGA CTCAATCGTC GAGGTTTCCG GCGCCAGGTT GAACAGCTTC GCCACCGACA CCAAGAACTG CGCCACTTCC TGCCACCTTA CCGGCACCCC GAAGTGGACC GAGACCGCAT CCGGCGCTTG CGGTACCTGC CATAAGGCGC TTTCCACCAC GGTGAACGGC CTTGTTTCCA GCAACGCCCA CTCCGCCCAC TTCACGGCGA CCTACGGCCC GGGCATGAAC GGCGCGGCGG CCACCTCCTG CGCCGGCTGC CATACCCCGA ACACCGCGGC AAGCCACGCC GACGGCACGC TCAACCTCGC CATCGGCTAC AACAAGATCG GCACCTGCTC CAGCTGCCAT AAGCAGAACA CCACCTGGAC CGGCGGGCGC GTTTCCTGCG AAAGCTGCCA CAGCACCGCA GGGGGCGAGC TCTCCGTCAT CGGCGCCCTC ACCGCTCCGG ACAAGACCCT GGCTGCGACC GCGGGTCACG GCAAGGCGGG CGTCGACCAG TCCTGCTCCG CTTGCCACGA CGCCAACTCC GCCCACATCA ACGGCGTAGC CGGCGACAAC AAGCGTCTCC TTGGTGCCTT GACCGGCGCC GACAACCAGG AGTGTAACTA CTGCCACACC GATCCGGCGA AGACCACCGG GTTCACGCTC GGCGTGAAAG TCCACCAGGC TTCCGGCCTG GGCGCCAAGT GCGCCGACTG CCACAACGCT CACGGTACCG CCAACAGCAT GATGGTTAAC GGCACGATCA ACGGGACCAA CGTCAGCTTC ACCGGCAACA GCACCTTCGC CAACGGCGCC AACACCGGCG TCTGCCAGGT CTGCCACACG GCGACCGATT ACTTCAAGAA AGACGGCACC GGCGCAACCC ACGTGGAGTC CACCACGAAC TGCTTGAACT GCCACGCTCA CAACCCGTCC ACCGGCCTCG CCTTCATGGC CAACGGCGCT TGCGACGCCT GCCACGGCTA CCCGCCGGCT CCGAGGCAGA CCATCTCCGC GGTCACCTTC GGCGTCATGG GCAACTGGTC TTCCGCCCGC TTCGAGGACT ACTCCGGCGG CGGCGGTGCC CACGTGGTCG CGGCGCACAT CAAGAAGGAC GCCAACCCCT CCGAGGGTTG GGCAAACTGC ATCCCTTGCC ACTTCGAGGG TCAGGCCGGC CACAACAGGG CGCTTCCGGT CAGGAACTTC GTCGAGAACG TGACCGTGAA ACTCGACCCG CAGTACCGCT TCAGCAACGA AGTCATGGCA ACCTACACCT CCGCCCAGCT TGTTTCCGGC GGGGCCAACA AGTCCGGCAG CTGCTTCAAC GTGAGCTGCC ACTTCACCAA GACCAGGCCG TGGAGTATCG AAAGGTAG
|
Protein sequence | MLPKYGGRCL FVLALLVLTL MQLTAGQAQA ASQYSYDCTF CHQMPPLDSA NAKKDPNTGA IPGNHQGHAS AAVNSCAKCH GDASGYAMGH RNKTIELADG IGYSRKIAAG FVNQTSVPPN PMGTCSTAAC HSNGKGVLKT TPAWGAAPFQ APGDCSQCHD VAPASGNHPT LGSKHAAYFG TGTGSCVKCH TDHTAQAKPF SHATSAGRAI EVTFADGGSF AANQCSNVSC HSNGQGSFTP PTWGATLDCA GCHGTATSDT LSGNHAKHVN NAGVLGTSYG CVECHSSTVT DNSTIGNFAN HVNKSKEVAG AHVGTPVAGS CSTSYCHSDG KGTQKSVTWT QTETLDCKGC HGSAAPASFA SIAGEPNYAN GGVDQPRANS HENHVFGAAS CQNCHSSTTV DGLTIKAGAP HTDGTRNVAA GNGKSFTIAA NSCSAVSCHD GGGILTGVGA VKWGGTLGCD GCHGDVETLA TNAHTAHVAT KGYACDTCHE QTVSGSYSFV NKALHGDSIV EVSGARLNSF ATDTKNCATS CHLTGTPKWT ETASGACGTC HKALSTTVNG LVSSNAHSAH FTATYGPGMN GAAATSCAGC HTPNTAASHA DGTLNLAIGY NKIGTCSSCH KQNTTWTGGR VSCESCHSTA GGELSVIGAL TAPDKTLAAT AGHGKAGVDQ SCSACHDANS AHINGVAGDN KRLLGALTGA DNQECNYCHT DPAKTTGFTL GVKVHQASGL GAKCADCHNA HGTANSMMVN GTINGTNVSF TGNSTFANGA NTGVCQVCHT ATDYFKKDGT GATHVESTTN CLNCHAHNPS TGLAFMANGA CDACHGYPPA PRQTISAVTF GVMGNWSSAR FEDYSGGGGA HVVAAHIKKD ANPSEGWANC IPCHFEGQAG HNRALPVRNF VENVTVKLDP QYRFSNEVMA TYTSAQLVSG GANKSGSCFN VSCHFTKTRP WSIER
|
| |