Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2216 |
Symbol | |
ID | 8137552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2584980 |
End bp | 2586851 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644869829 |
Product | cytochrome c family protein |
Protein accession | YP_003022024 |
Protein GI | 253700835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.000000000696852 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGC TGCTCACGAT GTTGATGTTG CTGTTGACGG CGCAGGCTTG GGCATTCGAT GCCAAATCGC AGTGCGTAAT CTGCCATGGA GACAAGGGTA AGATGGAGTC TCTTGGCGCC GCGTCGATGT ATCTGGATCC GGCCCAGGTT GATCGTGAAG TGGGTATGGA CGGCGCCACT TGTGTCGATT GTCACCTGGG GGACCCTTCC CAGCCGTCGA AGGAAGCATC CCACAAGGAC ATGCTGCGTC CTTTCGTGGT CGGGGTGGGG CCAAAGGTCA AGGGGCAGGC GGTTTCCCGC GCCGATGCAG GGGCTCTGAA ACCCATCGTT CCTGACGGAG ACGGCATGGA CCGGATGCTA CCGGAGGGTG ACCCCAAGAA GCTGGAGGGC CTCGGGGTAA AGATACTGAC GGGAATCGAG TGGCACGACC GCGATCCCGA AACGCTCGCG TATGCTCCCA AGGTTGCCGA GCAGACCTGC GGCAGGTGTC ATGCAAAAGA GGTGAAGGAT TACAACAGTT CCGCCAAGGG GCTGTTGAAA CATCAGCGCG CCTATCGGAA GTGGGCCGAG ACCCTTCCGG GACCGCAGAA CTGCGGCATG TGGTTCGGGC AGAACTACGA GAATCTGAAG AGCCAGACTT CGGTACCTTT CAGCGCCGCA CAAAATGCTG CCACGGATCG CAGCTGCAAC ACGTGCCATC CCGGTTGTAA TGACTGCCAC TACAAGCCCT TCACAGGCAA GGGGCGGCAC TCATACGGCA AACCGGACAC CGATAGCTGC TACGGCGGAG GGAGAGCGAG CATCTGTCAT GCGGGACCCA TGGACCGGAG GCGCGGCGCG GGATACGTCC GCGGCGAATA CGCCTTTCCG AGCAACCTGC CGCGAGGCGC CCACGTAAAG GCCGGCGTAC AGTGCCTTGA TTGCCACAAG CCTGTGAACC ATCAATTCGG CCATCTCGCC GCCGACGACG CGAGGGGGGC TTGCGCCAAG TGCCACGCCG ACATCGTGAA AGCCGTGCAG ACTTCGGCGC ACAAAAAGGT TGATTGCGGC GCCTGCCACA TAACCGTCTC CGGCGCCTAC CAGTACACCT TCTGGGGTAA GGGAAACTTT GCCGGCGTGG AAACGCCTTA CGGAAAACAT AAGGAATACT ACGGCATTCG CGACCTCCCG ACCCTGATCA AGAACGCCTC CGGCCGCTGG ATTCCGGTTA AGCCGTATCC TATGGCGGTG CTGAACCAGA CAATGGAAGT AGGCCCCACC GGGCTTCTGT TCCGCTCGAT CCCAAAGAGA AGCGTTCCAG GCAACGTCAG GATAGGTGAG CCTCCCGCAT TCGAAGTCTC CCGTGCCGCC ACCGATGTCA ATGACGCCTT CATCATCGTC GGCACCCGTA ACGATCTACC CTCCGGCAAC AAGGCGATCC TTTGGGTGCA GATGGACAAG CTAAGCCATG CCCTGGGTAA GCCGAGAGGA TGCGCGACCT GTCATGACTC CCACGCGCAG GTCGGAAAGT CCGAGTGGAG CTATTTCGAA TCAAAGGACG TAACCAAACG GTTCAAAGGG AGCTACACGG TGACGGCGGA CAAGAACGGG ATCAGGTTCA GCGACGCAGT GTGGGAGACC CCGATCATGG CAGCTAATCG GAAGGTCGAG GACATAGCGC CGTTTGCCGT GCTGCCCAAA GACGCCTGGG ATGTGAAGCG GATAAACCTC TCCATCCCCT TCGACGAGAA GAAGACGGGG AAGGAGAGGG GAGAGCTCGA CAAGTTCCTG GCCGAGCTTG GCAAGCGGAA GGGAGGCGAT GAACTGCGAA AGATAAGGGT GATCGCCTAC CATAACCTTG CCATGGCGAA AAAGATGCTG AAGGCACTTT AG
|
Protein sequence | MKTLLTMLML LLTAQAWAFD AKSQCVICHG DKGKMESLGA ASMYLDPAQV DREVGMDGAT CVDCHLGDPS QPSKEASHKD MLRPFVVGVG PKVKGQAVSR ADAGALKPIV PDGDGMDRML PEGDPKKLEG LGVKILTGIE WHDRDPETLA YAPKVAEQTC GRCHAKEVKD YNSSAKGLLK HQRAYRKWAE TLPGPQNCGM WFGQNYENLK SQTSVPFSAA QNAATDRSCN TCHPGCNDCH YKPFTGKGRH SYGKPDTDSC YGGGRASICH AGPMDRRRGA GYVRGEYAFP SNLPRGAHVK AGVQCLDCHK PVNHQFGHLA ADDARGACAK CHADIVKAVQ TSAHKKVDCG ACHITVSGAY QYTFWGKGNF AGVETPYGKH KEYYGIRDLP TLIKNASGRW IPVKPYPMAV LNQTMEVGPT GLLFRSIPKR SVPGNVRIGE PPAFEVSRAA TDVNDAFIIV GTRNDLPSGN KAILWVQMDK LSHALGKPRG CATCHDSHAQ VGKSEWSYFE SKDVTKRFKG SYTVTADKNG IRFSDAVWET PIMAANRKVE DIAPFAVLPK DAWDVKRINL SIPFDEKKTG KERGELDKFL AELGKRKGGD ELRKIRVIAY HNLAMAKKML KAL
|
| |