Gene GM21_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3564 
Symbol 
ID8138936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4137082 
End bp4139949 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content66% 
IMG OID644871183 
Productcytochrome C family protein 
Protein accessionYP_003023343 
Protein GI253702154 
COG category 
COG ID 
TIGRFAM ID[TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones126 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCTA AGTATGGAGG ACGATGCTTA TTTGTGCTGG CGCTTCTGGT GCTGACACTG 
ATGCAGCTTA CCGCGGGACA GGCACAAGCC GCCTCGCAGT ACAGCTATGA CTGTACCTTC
TGTCACCAGA TGCCGCCGCT GGATTCGGCT AACGCGAAGA AGGACCCCAA CACGGGCGCC
ATTCCCGGCA ACCACCAGGG CCACGCTTCG GCAGCTGTCA ACTCCTGCGC CAAGTGCCAC
GGCGACGCGA GCGGCTACGC CATGGGGCAC AGGAACAAGA CCATCGAGCT CGCCGACGGC
ATCGGCTACT CCAGGAAGAT CGCGGCCGGG TTCGTGAACC AGACCTCGGT TCCGCCGAAC
CCGATGGGAA CCTGCTCCAC CGCGGCCTGC CACAGTAACG GCAAGGGCGT GTTGAAGACG
ACCCCGGCCT GGGGCGCCGC ACCGTTCCAG GCTCCCGGCG ACTGCTCCCA GTGCCACGAC
GTCGCTCCCG CCTCCGGCAA CCACCCGACG CTGGGCAGCA AGCACGCCGC CTACTTCGGC
ACCGGTACCG GCTCCTGCGT CAAGTGCCAT ACCGACCACA CCGCGCAGGC CAAGCCGTTC
AGCCACGCGA CCAGCGCCGG CCGCGCCATC GAGGTGACCT TCGCCGACGG CGGCTCCTTC
GCGGCCAACC AGTGCTCCAA CGTCTCCTGC CACAGCAACG GCCAGGGAAG CTTCACCCCG
CCGACCTGGG GTGCCACCCT TGACTGCGCC GGCTGCCACG GCACCGCGAC CAGCGACACC
CTCTCCGGCA ACCACGCCAA ACACGTGAAC AACGCCGGCG TGCTCGGCAC CAGCTACGGC
TGCGTCGAGT GCCACAGCTC CACCGTGACC GACAACAGCA CCATCGGCAA CTTCGCGAAC
CACGTCAACA AGAGTAAGGA AGTGGCGGGC GCCCACGTAG GGACCCCGGT CGCCGGCTCC
TGCTCCACCT CCTACTGCCA CAGCGACGGC AAGGGGACCC AGAAGAGCGT GACCTGGACC
CAGACCGAAA CCCTGGACTG CAAAGGGTGC CACGGCTCCG CAGCACCGGC TTCCTTCGCC
TCCATCGCCG GCGAGCCTAA CTACGCAAAC GGCGGCGTCG ACCAGCCGCG CGCCAACAGC
CACGAAAACC ACGTCTTCGG GGCCGCCAGC TGCCAGAACT GCCATAGCAG CACCACCGTC
GACGGCCTCA CCATCAAGGC AGGCGCTCCG CACACCGACG GCACCCGCAA CGTGGCAGCC
GGCAACGGCA AGAGCTTCAC CATCGCAGCG AACTCCTGCT CCGCGGTATC CTGCCATGAC
GGCGGCGGCA TCTTAACAGG CGTCGGGGCT GTCAAATGGG GCGGGACCCT CGGCTGCGAC
GGCTGCCACG GCGACGTCGA AACGCTCGCC ACCAACGCCC ATACGGCGCA CGTTGCCACC
AAGGGCTATG CCTGCGACAC CTGCCACGAG CAGACCGTTT CGGGGAGCTA CAGCTTCGTC
AACAAGGCGC TGCACGGCGA CTCAATCGTC GAGGTTTCCG GCGCCAGGTT GAACAGCTTC
GCCACCGACA CCAAGAACTG CGCCACTTCC TGCCACCTTA CCGGCACCCC GAAGTGGACC
GAGACCGCAT CCGGCGCTTG CGGTACCTGC CATAAGGCGC TTTCCACCAC GGTGAACGGC
CTTGTTTCCA GCAACGCCCA CTCCGCCCAC TTCACGGCGA CCTACGGCCC GGGCATGAAC
GGCGCGGCGG CCACCTCCTG CGCCGGCTGC CATACCCCGA ACACCGCGGC AAGCCACGCC
GACGGCACGC TCAACCTCGC CATCGGCTAC AACAAGATCG GCACCTGCTC CAGCTGCCAT
AAGCAGAACA CCACCTGGAC CGGCGGGCGC GTTTCCTGCG AAAGCTGCCA CAGCACCGCA
GGGGGCGAGC TCTCCGTCAT CGGCGCCCTC ACCGCTCCGG ACAAGACCCT GGCTGCGACC
GCGGGTCACG GCAAGGCGGG CGTCGACCAG TCCTGCTCCG CTTGCCACGA CGCCAACTCC
GCCCACATCA ACGGCGTAGC CGGCGACAAC AAGCGTCTCC TTGGTGCCTT GACCGGCGCC
GACAACCAGG AGTGTAACTA CTGCCACACC GATCCGGCGA AGACCACCGG GTTCACGCTC
GGCGTGAAAG TCCACCAGGC TTCCGGCCTG GGCGCCAAGT GCGCCGACTG CCACAACGCT
CACGGTACCG CCAACAGCAT GATGGTTAAC GGCACGATCA ACGGGACCAA CGTCAGCTTC
ACCGGCAACA GCACCTTCGC CAACGGCGCC AACACCGGCG TCTGCCAGGT CTGCCACACG
GCGACCGATT ACTTCAAGAA AGACGGCACC GGCGCAACCC ACGTGGAGTC CACCACGAAC
TGCTTGAACT GCCACGCTCA CAACCCGTCC ACCGGCCTCG CCTTCATGGC CAACGGCGCT
TGCGACGCCT GCCACGGCTA CCCGCCGGCT CCGAGGCAGA CCATCTCCGC GGTCACCTTC
GGCGTCATGG GCAACTGGTC TTCCGCCCGC TTCGAGGACT ACTCCGGCGG CGGCGGTGCC
CACGTGGTCG CGGCGCACAT CAAGAAGGAC GCCAACCCCT CCGAGGGTTG GGCAAACTGC
ATCCCTTGCC ACTTCGAGGG TCAGGCCGGC CACAACAGGG CGCTTCCGGT CAGGAACTTC
GTCGAGAACG TGACCGTGAA ACTCGACCCG CAGTACCGCT TCAGCAACGA AGTCATGGCA
ACCTACACCT CCGCCCAGCT TGTTTCCGGC GGGGCCAACA AGTCCGGCAG CTGCTTCAAC
GTGAGCTGCC ACTTCACCAA GACCAGGCCG TGGAGTATCG AAAGGTAG
 
Protein sequence
MLPKYGGRCL FVLALLVLTL MQLTAGQAQA ASQYSYDCTF CHQMPPLDSA NAKKDPNTGA 
IPGNHQGHAS AAVNSCAKCH GDASGYAMGH RNKTIELADG IGYSRKIAAG FVNQTSVPPN
PMGTCSTAAC HSNGKGVLKT TPAWGAAPFQ APGDCSQCHD VAPASGNHPT LGSKHAAYFG
TGTGSCVKCH TDHTAQAKPF SHATSAGRAI EVTFADGGSF AANQCSNVSC HSNGQGSFTP
PTWGATLDCA GCHGTATSDT LSGNHAKHVN NAGVLGTSYG CVECHSSTVT DNSTIGNFAN
HVNKSKEVAG AHVGTPVAGS CSTSYCHSDG KGTQKSVTWT QTETLDCKGC HGSAAPASFA
SIAGEPNYAN GGVDQPRANS HENHVFGAAS CQNCHSSTTV DGLTIKAGAP HTDGTRNVAA
GNGKSFTIAA NSCSAVSCHD GGGILTGVGA VKWGGTLGCD GCHGDVETLA TNAHTAHVAT
KGYACDTCHE QTVSGSYSFV NKALHGDSIV EVSGARLNSF ATDTKNCATS CHLTGTPKWT
ETASGACGTC HKALSTTVNG LVSSNAHSAH FTATYGPGMN GAAATSCAGC HTPNTAASHA
DGTLNLAIGY NKIGTCSSCH KQNTTWTGGR VSCESCHSTA GGELSVIGAL TAPDKTLAAT
AGHGKAGVDQ SCSACHDANS AHINGVAGDN KRLLGALTGA DNQECNYCHT DPAKTTGFTL
GVKVHQASGL GAKCADCHNA HGTANSMMVN GTINGTNVSF TGNSTFANGA NTGVCQVCHT
ATDYFKKDGT GATHVESTTN CLNCHAHNPS TGLAFMANGA CDACHGYPPA PRQTISAVTF
GVMGNWSSAR FEDYSGGGGA HVVAAHIKKD ANPSEGWANC IPCHFEGQAG HNRALPVRNF
VENVTVKLDP QYRFSNEVMA TYTSAQLVSG GANKSGSCFN VSCHFTKTRP WSIER