Gene GM21_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3543 
Symbol 
ID8138915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4099593 
End bp4102466 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content66% 
IMG OID644871162 
Productcytochrome C family protein 
Protein accessionYP_003023322 
Protein GI253702133 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones142 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTTA GGTATGGAGG ACGATGTTTG TTTGTGCTGG CACTTCTGGT GCTGACACTG 
ATGCAGCTTA CCGCGGGACA GGCACAAGCC GCCTCGCAGT ACAGCTATGA CTGTACCTTC
TGTCACCAGA TGCCGCCGCT GGATTCGGCT AACGCGAAGA AGGACCCCAA CACGGGCGCC
ATTCCCGGCA ACCACCAGGG CCACGCTTCG GCAGCGGTCA ACTCCTGCGC CAAGTGCCAC
GGCGACGCGA GCGGCTACGC CATGGGGCAC AGGAACAAGA CCATCGAGCT CGCCGACGGC
ATCGGCTACT CCAGGAAGAT CGCGGCCGGG TTCGTGAACC AGACCTCGGT TCCGCCGAAC
CCGATGGGAA CCTGCTCCAC CGCGGCCTGC CACAGTAACG GCAAGGGCGT GTTGAAGACG
ACCCCGGCCT GGGGCGCCGC ACCGTTCCAG GCTCCCGGCG ACTGCTCCCA GTGCCACGAC
GTCGCTCCCG CCTCCGGCAA CCACCCGACG CTGGGCAGCA AGCACGCCGC CTACTTCGGC
ACCGGTACCG GCTCCTGCGT CAAGTGCCAT ACCGACCACA CCGCGCAGGC CAAGCCGTTC
AGCCACGCGA CCAGCGCCGG CCGCGCCATC GAGGTGACCT TCGCCGACGG CGGCTCCTTC
GCGGCCAACC AGTGCTCCAA CGTCTCCTGC CACAGCAACG GCCAGGGGAG CTTCACCCCG
CCGACCTGGG GCGCCACCCT TGACTGCGCC GGCTGCCACG GCACCGCGAC CAGCGACACC
CTCTCCGGCA AGCACGGCAA ACACGTGAAC AACGCCGGCT TCCTCGGCAC CAACTACGGC
TGCGTGGAGT GCCACTCCGC CACCGTGATC GACAACAGCA CCATCGGCAA CTTCGGGAAC
CACGTCAACA AGACCGCCGA CGTCGCGGGC GCCCACGTAG GGACCCCGGT CGCCGGAAGC
TGCTCCACCT CTTACTGCCA CAGCGACGGC AAGGGGACCC AGAAGAGCGT GACCTGGACC
CAGACCGAAG TTCTTACCTG CAAAAGCTGC CACGGCTCCG ATGCGGCTCC CGCCTTCGCC
TCGGTCGCCG GCGAGCCTAA CTACGTGAAC GCCGGCCCCG ACCAGCCGCG CGCCAACAGC
CACCAAAACC ACGTCGTCGG GGCTGCGAAC TGCCAGAACT GCCATAGCAG CACCACCGTG
GACGGCCTCA CCATCAAGGC CGGCACTGCG CACACCGACG GCACCCGCAC CGTCGCAGCC
GGCAACGGCA AGAGCTTCGT CATCGCAGCG AACAGCTGCT CCGCGGTATC CTGCCACGAC
GGCGGCGGCA TCGTAGCAGG CGTCGGCGCT GCCAAATGGG GCCAGTCGCT TGGTTGCGCC
GGTTGCCACG GCGACGCGGC AACCCTTACC ACCAACGCGC ACGCGGCCCA CGTTTCCACC
AAGGGCTATG CCTGCGACAC CTGCCACGCC CAGACCGTTA CCGGCAGCAC CAACTTCGCG
AACAAGGCGC TGCACGGCGA CGCCATCGTA GAGGTTGCCG GCGTCAAGGT AACGAGCTTC
GCCACCGACA CCAAGAACTG CGCCACTTCC TGCCACCTTT CCGGCGCACC TAAGTGGAAC
GACACCGCTT CCGGCGCTTG CGGCACCTGC CATAACGCGC TTTCCACCAC CGTTAACGGC
CTTGTTTCCA GCAACGCCCA CTCCGCCCAC TTCACGGCTA CCTACGGCCC GGGCATGAAC
GGCGCGGCGG CCACCTCCTG CGCCGGCTGC CATACCCCCA ACACCGCGGC GAGCCACGCA
GACGGCACGC TCAACCTCGC CATCGGCTAC AACAAGATCG GCACCTGCTC CAGCTGCCAC
AAGCAGAACA CCAACTGGAC CACCGGGCGC GTTTCCTGCG AAAGCTGCCA CAGCACCGCA
GGGGGCGAGC TCTCCGTCAT CGGCGCCCTC GCCGCTCCGG ACAAGACCGG TGCTGCGACC
GCGGGTCACG GCAAGGCGGG CATCGACCAG TCCTGCTCCG CTTGCCACGA CGCCAACTCC
GCTCACATCA ACGGCGTAGC CGGCGACAAC AAGCGTCTGC TTGGTGCCTT GACCGGCGCC
GACAACCAGG AGTGTAACTA CTGCCATGCC GACGCCGGCA AGGTCGCCGG CGCCGCTCTC
AACGTGAAAG CCCACCGCCT CACCGGCGAA CTCGGCGCGA AGTGCTCCGA CTGCCACAAC
GCGCACGGTA CCGCCAACAG CATGATGGTT AACGGCACGA TCAACGGGAC CGAGGTGAGC
TTCACCACCG TCGACAGCTT CGCCAACGGA GCACGCACCG GCGTCTGCCA GGTCTGCCAC
ACCACGACCC AGTACTTCAC CAAGGCCGGC CAGCCGGAGG CGACCCACGT CGACTCCACC
GCGAACTGCC TCGAGTGCCA CCAGCACAAC CCGGCAACCG GCCTCGCCTT CGTTCCCAAC
GGCGGATGCG ACGCCTGCCA CGGCTACCCG CCGGCCCCGA GGCAGACCAT CTCCGCGGTC
ACCTTCGGCG TCATGGGCAA CTGGTCCACC GCCCGCTTCG AAGACTACTC CGGCGGCGGC
GGTGCCCACC TGGTCGCAGC GCACATCAAG AAAGACGCCA AGCCGTCCGA GGGTTGGGCA
AACTGCCTCC CCTGCCACAA CGGCGGCAAC GAAGCGCACG CAAGGGCTCT GCCGATCAGG
AACCACGTGG ACAGCGTCAC CGTCCAGATC GATCCGCAGT ACCGCTTCAG CGACGATCCG
TTCCTGGGCT ACACCTCCGC CCAGCTGGTC AACGGCGGGG TCAACAAGTC CGGAAGCTGC
TTCAACGTGA GCTGCCACTT CAAGAAGACC AAGCCGTGGA GTATCGAAAG GTAG
 
Protein sequence
MLFRYGGRCL FVLALLVLTL MQLTAGQAQA ASQYSYDCTF CHQMPPLDSA NAKKDPNTGA 
IPGNHQGHAS AAVNSCAKCH GDASGYAMGH RNKTIELADG IGYSRKIAAG FVNQTSVPPN
PMGTCSTAAC HSNGKGVLKT TPAWGAAPFQ APGDCSQCHD VAPASGNHPT LGSKHAAYFG
TGTGSCVKCH TDHTAQAKPF SHATSAGRAI EVTFADGGSF AANQCSNVSC HSNGQGSFTP
PTWGATLDCA GCHGTATSDT LSGKHGKHVN NAGFLGTNYG CVECHSATVI DNSTIGNFGN
HVNKTADVAG AHVGTPVAGS CSTSYCHSDG KGTQKSVTWT QTEVLTCKSC HGSDAAPAFA
SVAGEPNYVN AGPDQPRANS HQNHVVGAAN CQNCHSSTTV DGLTIKAGTA HTDGTRTVAA
GNGKSFVIAA NSCSAVSCHD GGGIVAGVGA AKWGQSLGCA GCHGDAATLT TNAHAAHVST
KGYACDTCHA QTVTGSTNFA NKALHGDAIV EVAGVKVTSF ATDTKNCATS CHLSGAPKWN
DTASGACGTC HNALSTTVNG LVSSNAHSAH FTATYGPGMN GAAATSCAGC HTPNTAASHA
DGTLNLAIGY NKIGTCSSCH KQNTNWTTGR VSCESCHSTA GGELSVIGAL AAPDKTGAAT
AGHGKAGIDQ SCSACHDANS AHINGVAGDN KRLLGALTGA DNQECNYCHA DAGKVAGAAL
NVKAHRLTGE LGAKCSDCHN AHGTANSMMV NGTINGTEVS FTTVDSFANG ARTGVCQVCH
TTTQYFTKAG QPEATHVDST ANCLECHQHN PATGLAFVPN GGCDACHGYP PAPRQTISAV
TFGVMGNWST ARFEDYSGGG GAHLVAAHIK KDAKPSEGWA NCLPCHNGGN EAHARALPIR
NHVDSVTVQI DPQYRFSDDP FLGYTSAQLV NGGVNKSGSC FNVSCHFKKT KPWSIER