Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3542 |
Symbol | |
ID | 8138914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4094925 |
End bp | 4099286 |
Gene Length | 4362 bp |
Protein Length | 1453 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871161 |
Product | cytochrome C family protein |
Protein accession | YP_003023321 |
Protein GI | 253702132 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 139 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGCT ACCGCAGACC TGGATATGGC AATCTCCGAA CGATCCTTAC TTTGGTGCTG TTCGGCGTTG TGATGTTCAC TGCATCGCTG GCGCTGGCCG CGCCCCAGTA CGACCTTAAG TGCGACGATT GCCATACCAT GCCTCCGCTC GACTCTCTGA CGGGGCAGCG TGAGCCGGAC TTCGGTTCGA TCCGCGGCAA TCACCAGACC CATGCGGGGA ACACGGCTGA CAGCTGCGCC AAGTGTCACG GGCCGCAGGT GACCAGCACC GGGCATCGCG ACGGCGTCAT TCAGGTGCAG CAGGACCTGA ACGGCAGCAC CGGAGCGAGC TACAGCAGGG GGTTCTTCAA CCAAACTTCC GTCCCCCCGG CAACCCTTGG GACCTGCTCC AACGTCAACT GCCACTTCGA GACGGTGACC CCCGAATGGG GCACGCCTGC TCTGGGCGAG CCTTTCGGCA CCTCCTGCGG GACCTGCCAC CAGGCCATTC CCGACAGTTA CGCACATACT AAGCACCTGG CGTTGTACGG CAGCGACCTC ACCGTCTGCG CCAAGTGCCA TATCGATCAC ACCCAGGAAG CGAAGCCGTT CCAGCACGCC ACCAGCGCAG GGCGGAACAT AATCGTGACC GTCGGCTCCT ATGCCGGGTC CAACTTCGCT TACCTCCCCA GCCAGTCGGC CGCCAGGGTA GTAGGCAGCT GCTCCACCCT GTACTGCCAC AGCTCCGGGC AAAGCGCCAC AGGCGGCATG CTCGGCGCCG GCGACTACAG CGCACCTCTT TGGAGCGATG CGGCCAGCGG CGCCTGCAAC ACCTGCCACA AGACCTCCGC GCTTTCCTCC GGCAGCCACA GCCGCCACTT GGCCTCCAAC GGCAACTGCG GCAACTGCCA CGTCGACGCG ACGGCCACCG GCTACACTGC GCCGACCCAC GTGGACGGCA GCATCGACGT CGCCTTGGGC GGGTACGACA AGGCGGGTGC CCCTGGCAAC GGCTACGGCA GCTGCTCCAA CGCGACCTGC CACGTAAGCC CCTACGGTGC GCAAGCCACC CCCTCCCCGC AGTGGGGCCA GAGCGCTGGG TGCGGCGCCT GCCACAACAA CGCCGGCGCC TTCGCCGCCA ACGGCGCGCC CGACACCGGC AGCCATCAAA AGCACATGGC GCTCACGGGC TCGGCCTGCA ACCAGTGCCA TGACGGCGCG ATCAAAGACA CAAGCGGCGG GCTGAGCCAC ACAAACGGCA CCGTCGAGAT AGCGAACGAC TACCTCGGTT CGCCGGTACA AAAGCACGCG CCGGGGACCT ACACCGGAAG CTGCGCCTCC GCCTCCTGCC ACGCCGACCC GTACGGCCCG CTGCCGATAC AGTCGCCGGT TTGGGGCGAC AACTCCGGCT GCGCGGCATG TCACCGCGGC GCCGGCATCT TCACCGCCAC CGGCGCCCCC GCGACCGGCA GCCACGACAA GCATATGGCC CTCCCCGGGA CCGGCTGCGA CGCGTGCCAC TCCGGCGCGG TCAGCGGCGT CTCCGGCGGC CTGACCCACG CCGACACCAA GGTTGACGTC GTCAGCTACT CGCTCTCCCC GGTAACAAAG CACCCGGTCG GGACCTACGG CGGCACCTGC TCCAACTCCT GCCATTCAAA CGGCAACGGG GAGCAGACCC CGACCCCGGT ATGGGGCGGG AACATGCCGG AAAACTGTAG CGGTTGCCAC GGCGGCGCGG CAGGCCTCGG CACCGTCGCC ATCTCCACCG GCAAGCACGC AGCCCACATG AACAACTACA CGACCATGGG GCGCGGCAAC AACCTCATGT GCGCCGAGTG CCACGCGAAA ACCGTCTCGC TCGCCAGCAA CACCGCGATC ACCGGCACCG TCCACTCGAA CAGCTTCAAG GACTACTCCG GCGTCAAGGC GGGGGGGAGC GCCAATTACG AAACCGGGAG CGGCATCTGC TCCAACGTCT ACTGCCACAG CTCCGGCCAG GCCACCCCGG TCTTCGCCAG CATGACCGGC TCCAAGACCT GGCGCGAAAC CGGCAAGCTC TCCTGCAACG GCTGCCACGG CAAGAGCGCC GGCGCAACCT GGGCGAGCGC CGCGGGCGCT CCGAACTACG CCAACAAGTA CGACGGCACG CTGAAGACCG CCAACAGCCA TGAAAAGCAC ACCGCTTCCA TGACCGACTC GACCGGTTGC GCCAACTGCC ACAACTTAAC CGTCGACAGC GGCGTCGCCG CCAAGATGCG CGACTACTCC TCCAGCCACT TGAACCGCGT CCGCGAGGTC GAGTTCAAGG TGGGTGGAAC CTACACCGAA GCGGACAAGA GCTGCGCCAC CTACTGCCAC AGCAACATCC AGGGGGGAAG CGGCAACGGC CAAGCCACCG TCTTCGGAAA CCCCGCCTGG GGCTCCAACG GCACGCTTAC CTGCGGCAGC TGCCACAGCA ACATGGCGGC CCTGCAGGAG ACCGCGCAAG ACCTGACCCT CGGGAGCCAC AAGCGGCACA CGGTGGACAG CATCTACAGC TGCTCCGTCT GCCACGGCGC CGGCAACACG GCAAGCAGCA CCGCCCCCTC GCACGCCGAC GGGGCCATCA ACCTGGCCTT CACCGAGAAG GGCGCCGGCA CCACCTACTC GCAGCCGCTG GAGAACACCC CGGGCGAAGG GTACGGCACC TGCTCGACCA GCTCCTGCCA CGGCCGCGCA ACCCGCAACT GGGGGGTCTC CACCACACTT GCCACCTGCG AGAAGTGCCA CGGCTCGGCC GATACGGCGC TGACCAGCGC GAGCTTCAAG GACACCGCGG GGAGCCCCAC TGGCCCCTAC GTCGGGACCC ACGTCTCGCA CCTGGCGGGT ACGCACAACT ACTCCAACCC GATCACCTGC GACCAGTGCC ATGCGGTTCC GTCCACGGTC GACGCCGCGG GTCACATGGA CGGCCTGCCG GCCACCCTCA CCTGGGGCTC CACCGCCACC CACCCGAGCT ACGCCGGCGG CGTGGAAGGC GCGCCCATGG TACCCGAGTA CGTGCCGGCC TCGCGCGTCT GCAACAACAC CTACTGCCAC GCCGGCGTCA GGAAGACCTC CGACGGCACA GCCCAGGGGA CCGGCGCCAG CCCCTCCTGG AACGACCCGG CTTACTTAGG CGGCACCGGC TGCAACAAGT GCCACATGAA CCCGCCGGCG TATCCGCACT CGGCCAGCGC GAACTGCAGC GCCTGCCACA ACCACGTGGA TCAGTCCAAC GTCGCCTTCG TCGATAAGAC GAAGCATCTT AACGGCATCG TAGAGGTCAC CGTGGACGAC TGCCTCGGCT GCCACTCCTC GACCAACCTC TGCGCACCGG GCGACCCGGC GTGCATCAAC AAGGAGCTGA TCGGGGCGCA TAATTTCCAC ACCGACGCCG AGCTGTTCCT GGCGGGCAAG AAGCTCTCCA GCGGCGATTT CATCGACACC TCCTGGATCT ACGCGATCGA GTACAAGGAC GGCTTCCCGA AATACGGCTG CGGTTTCTGC CACCCAATGG ACTCCGGGAC CCACAAGAAC GCCATCGTGG AACTGGACCT CGATCCGAGC CACAGCCTGG CCGGCACCGT CAAGACCAAG AACAAGGCGG GCGGCCCGTG GGTGACCCAA ACCATGGGAA GCAGCGTGGT CTGCTCCAAC GTCTACTGCC ACTCCAACGG CTACGTCTCC GACACTACGC AGCAGTACCA GTTCCAGACC ACCCCGGACT GGTACGCGGT CAACCCGTGG GGCGGCGTCG ACAAGTGCGC CCAGTGCCAC GGCAACTCTC CCAACACCGC CGGCAAGGAA GGTTCGGCGG CTCACGCACG GCACGTCGTC GGCAACCACT ACGCCGACAT CTTCGACGGC TACAGCGCAA GGATCGCAGT CGCAGGCGCG CCGGGCTCAG GTGCCGCGCA CGGCGACCCG GCCACCTCGA CCACCTTCAA CTGCAACCTG TGCCACTACT CGACGGTCGC GGTCTCCTAC AACGACAAGG GGAGCGTCTG CTCCAGCTGC CACGTGGCCG CAGGCTCCGC GCCGCTCAAG GGTAACCTGC AGGTCGTAGC CACCAGCCAG AGCCACATAA ACGGCGACGT CGACGTGGCA TTCATGACTC CGTTCAACGT GAAGTCCAAG GCACAGCTCA GGATGGGCGT CGATGCAGTC CAGTCGATCT ACACCAGCTG GACCCGCGTG AACGGTTACA AGACCGCCAC TTCCTACGAT CTGGCCCGGA GCACTCCGAG CTACGTCGGC GGTACCTGCT CCACCGTCGC CTGCCACAAC GGCACGCAGA TGGAGTGGCG CACCAAAGGA CCGCTGGCCT GTGCAGCCTG CCACACCGGG CTGCCGCAAT AG
|
Protein sequence | MKSYRRPGYG NLRTILTLVL FGVVMFTASL ALAAPQYDLK CDDCHTMPPL DSLTGQREPD FGSIRGNHQT HAGNTADSCA KCHGPQVTST GHRDGVIQVQ QDLNGSTGAS YSRGFFNQTS VPPATLGTCS NVNCHFETVT PEWGTPALGE PFGTSCGTCH QAIPDSYAHT KHLALYGSDL TVCAKCHIDH TQEAKPFQHA TSAGRNIIVT VGSYAGSNFA YLPSQSAARV VGSCSTLYCH SSGQSATGGM LGAGDYSAPL WSDAASGACN TCHKTSALSS GSHSRHLASN GNCGNCHVDA TATGYTAPTH VDGSIDVALG GYDKAGAPGN GYGSCSNATC HVSPYGAQAT PSPQWGQSAG CGACHNNAGA FAANGAPDTG SHQKHMALTG SACNQCHDGA IKDTSGGLSH TNGTVEIAND YLGSPVQKHA PGTYTGSCAS ASCHADPYGP LPIQSPVWGD NSGCAACHRG AGIFTATGAP ATGSHDKHMA LPGTGCDACH SGAVSGVSGG LTHADTKVDV VSYSLSPVTK HPVGTYGGTC SNSCHSNGNG EQTPTPVWGG NMPENCSGCH GGAAGLGTVA ISTGKHAAHM NNYTTMGRGN NLMCAECHAK TVSLASNTAI TGTVHSNSFK DYSGVKAGGS ANYETGSGIC SNVYCHSSGQ ATPVFASMTG SKTWRETGKL SCNGCHGKSA GATWASAAGA PNYANKYDGT LKTANSHEKH TASMTDSTGC ANCHNLTVDS GVAAKMRDYS SSHLNRVREV EFKVGGTYTE ADKSCATYCH SNIQGGSGNG QATVFGNPAW GSNGTLTCGS CHSNMAALQE TAQDLTLGSH KRHTVDSIYS CSVCHGAGNT ASSTAPSHAD GAINLAFTEK GAGTTYSQPL ENTPGEGYGT CSTSSCHGRA TRNWGVSTTL ATCEKCHGSA DTALTSASFK DTAGSPTGPY VGTHVSHLAG THNYSNPITC DQCHAVPSTV DAAGHMDGLP ATLTWGSTAT HPSYAGGVEG APMVPEYVPA SRVCNNTYCH AGVRKTSDGT AQGTGASPSW NDPAYLGGTG CNKCHMNPPA YPHSASANCS ACHNHVDQSN VAFVDKTKHL NGIVEVTVDD CLGCHSSTNL CAPGDPACIN KELIGAHNFH TDAELFLAGK KLSSGDFIDT SWIYAIEYKD GFPKYGCGFC HPMDSGTHKN AIVELDLDPS HSLAGTVKTK NKAGGPWVTQ TMGSSVVCSN VYCHSNGYVS DTTQQYQFQT TPDWYAVNPW GGVDKCAQCH GNSPNTAGKE GSAAHARHVV GNHYADIFDG YSARIAVAGA PGSGAAHGDP ATSTTFNCNL CHYSTVAVSY NDKGSVCSSC HVAAGSAPLK GNLQVVATSQ SHINGDVDVA FMTPFNVKSK AQLRMGVDAV QSIYTSWTRV NGYKTATSYD LARSTPSYVG GTCSTVACHN GTQMEWRTKG PLACAACHTG LPQ
|
| |