Gene GM21_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1194 
Symbol 
ID8136519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1388929 
End bp1390404 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID644868808 
Productcytochrome c family protein 
Protein accessionYP_003021013 
Protein GI253699824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA AAACGATTAA ATTGCTGGTA GCGGCCGCGG CCATGCTCGT GAGCGCTTCA 
CTCGCCTTCG CGACACCGCC TCCGGCTCCC CAGACCGTAG GGATCAAGGA CACGGTCTTC
AAGAACTTCT CTGGTTCGAA CTACAAGCTC TGCCGCGACT GCCACACCCC CGGCTGGGTC
ACCGCGACCG ACAGCGACCT GGTCCTGAAG GACAAGCACC ATGCGTTGAT CAACCAGCCC
GGCGGCGTTG TGGTCAGCTG CAACAACGCC TCCGGCACCC TTCCTGCCAA TCTGGCTACC
GGATGCCATT ACATCACCAC CGATCCGGCC ACCGGCGTCA CCGCAGTCCA GGATCCAAGG
CCCTGCTTCA ACTGCCATAC CAAAGGTCCG CACCACTTGA CCGACCAGGC GGCAGCGCAA
AACTGCAAAT ACTGCCACGG CTCTGCCATC GACAACCCGG GTGACGGCCA CTGGATCCCG
ACCAGCACCG ACTACGCGAT GGATACGACC TTCAACGGCA TGACCCCTGC TCCGGTAGGC
CGCAGTGTCG TGGATCCCGC CGACCCGACC AAGACCATAA TCGTTCAGGG TTGCGAGGCC
TGCCACCAGG CCGACACCAC CCTTCAGATA TTCGCCAACA AAGACACCCA CCACAGCACC
GGTATCGGCC AAGACCTCAG CCCGGTCGGT AACTGCACCT GGTGCCATGC CGCGACCGGC
AGCGAAAACA ACTTCACCAT CCGCGCCTGC GAGGCCTGCC ACGGCATCGC CTCTTTGCAC
AACATCCAGG CCGACTCCCC GAATGCCGCA AACCTTGGGA CCATCGTTGC CAGCAACGAG
GAGCCGGGCT TTGGTCACGT CGGTAACAAC TGGGATTGCG TGGGCTGCCA CTACTCCTGG
ACCGGCACCG CCGTAAGCGA TACCACCGCT ACCGCGCCGT TTGTAAACGA GATCAGCGCC
ATCACCCTGC CGGCAGGCGT CGCCAACACC CTTACCCTCA CCGGTATGGG CTTCACCAAC
CTGGATGCCA CCGGGAACAA CTACATCCCG ACCGTGGTCC TGACCCGCGG AACTGAAACC
TTCAACCTGA TTCCGTTCTC CACCTCGGTG AGCGAAATCA AGGTTGCTCT CCCCACGACC
CTGGTTGCTG GCGTGTACGA AGTCCGCGTC AACAAGGGCG GCGAGACGGT CAGTAACCTG
AAGAGCCTTA CCCTCACGCC GAGACTTGCC GCCACCAACG CGCTCTTGAC CTCCACCACC
CTTACCATCA CCGGTACCGG GTTCAGCACC GCTCCGGCCA ATGAGTACCA GGGCCTTATG
GGTGTCTTCG TCGACGGCGT CCAGGCTCGG GTCATTTCCT GGAGCAACAC CAAGATCGTC
GCCACCGGCA CCAACTTCGC CGCCGGCAAA CTTGCCGTCG TGAAGTCCGT CTACGGGGAC
GTGACCCGTC CCATCACGGT ACCGATCAAG AAGTAA
 
Protein sequence
MEKKTIKLLV AAAAMLVSAS LAFATPPPAP QTVGIKDTVF KNFSGSNYKL CRDCHTPGWV 
TATDSDLVLK DKHHALINQP GGVVVSCNNA SGTLPANLAT GCHYITTDPA TGVTAVQDPR
PCFNCHTKGP HHLTDQAAAQ NCKYCHGSAI DNPGDGHWIP TSTDYAMDTT FNGMTPAPVG
RSVVDPADPT KTIIVQGCEA CHQADTTLQI FANKDTHHST GIGQDLSPVG NCTWCHAATG
SENNFTIRAC EACHGIASLH NIQADSPNAA NLGTIVASNE EPGFGHVGNN WDCVGCHYSW
TGTAVSDTTA TAPFVNEISA ITLPAGVANT LTLTGMGFTN LDATGNNYIP TVVLTRGTET
FNLIPFSTSV SEIKVALPTT LVAGVYEVRV NKGGETVSNL KSLTLTPRLA ATNALLTSTT
LTITGTGFST APANEYQGLM GVFVDGVQAR VISWSNTKIV ATGTNFAAGK LAVVKSVYGD
VTRPITVPIK K