Gene GM21_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3141 
Symbol 
ID8138492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3645061 
End bp3646212 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID644870745 
Producthypothetical protein 
Protein accessionYP_003022926 
Protein GI253701737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones153 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACTT CCTACGTCAC CATCATGCTG GTGGTCTTTC TCGTGCTGAC CGGTCTCGCC 
ATAGACATCG GTTACATGTA TGTAAGCGAC GAGGACCTGC AGCATTCCGC GGAAATGGCG
GCCCTGACCG GAGCCGAGTC GCTCAAAAAG CGGCTGCTCT TGCAGGCGCA GCACTCCCCG
GGAAAGCTGG CGCAGGTCTT AGCGGACCCT CTGCAGTCGG CGGCACGCAG CGTCGCCGTG
GACACCGCGA CCGGAAAACA CAGCGCCTCG GCACTCGTCG CCCTCATGAA CGACAACGGC
AACGCCCTCA CTGAAAACAA CGACATAACG GTCGGCTTTT GGAACATGAG TAGCCGCAGT
TACACCCCGG GGGGGACCCC GGTGAACGCG ATGCAGGTGC GGGCGCGGCG AACGGCGGAA
AGCAGCTCCG TGGGCCTTGG CAGCCTCGGC ACTTTCGTCG CCAAGATAAG CGGTACCGCC
TCCTTCGGCT CCACGCCGGT GGCGGTGGCG GCCCTGGTTC CCGGCACCCG CTCCAACATC
GCCATCTGCG CCGCCGCCTG CGAGCCCTCC TGCGGCTATC CCGATGTCTG CAGCATACCG
GAACGAAGGA TGAGCCACCT GCCCTGGGAT CCCCAGAGGG AAAATTCCAG CGCCAACCGC
TACCTCTACA CCTCGCTTTT GCACCCGGTC ACCATCACCA ACACCATGAG CGACCTCGTC
TGCCAGGAGA TGCCGGTGCA GGAAGTCTGC GGCCTCCCCA TCTTCACCGC CGCCATGAAG
ACAGACGCCA TCCTGCGCGA CCTTAAGGCG ATGATGTACG ACCCGAACGT GGACAGCTCC
AACAAGGAGT ACGACAACAA CGGGAAACTC GCGGGATGGT GGGTGGTGGT CCCCGCCACC
GACTGCGCCG GCTTCCAGGC GGGAGAGGCC TTCGAGCAGC ACACGGTGGT GAAGTACTCG
CTGGTGCGCA TCAGCAGGAT CTGCGCCGCT GGGGAGCCCG GCTGCGGCAA GGCCTCGGCC
AGCGCCGATC AGCCGGCAGT CGCCTGCGTC CCCGGCGGGG AAGGGCTTTA CATCGACCGC
ATCTCCTGCG TCGGCTGCGA CAACGCCTCG AAGAGGCAAT TCTTCGGGCT GCGCCCCGTC
CTGGTCAACT AG
 
Protein sequence
MDTSYVTIML VVFLVLTGLA IDIGYMYVSD EDLQHSAEMA ALTGAESLKK RLLLQAQHSP 
GKLAQVLADP LQSAARSVAV DTATGKHSAS ALVALMNDNG NALTENNDIT VGFWNMSSRS
YTPGGTPVNA MQVRARRTAE SSSVGLGSLG TFVAKISGTA SFGSTPVAVA ALVPGTRSNI
AICAAACEPS CGYPDVCSIP ERRMSHLPWD PQRENSSANR YLYTSLLHPV TITNTMSDLV
CQEMPVQEVC GLPIFTAAMK TDAILRDLKA MMYDPNVDSS NKEYDNNGKL AGWWVVVPAT
DCAGFQAGEA FEQHTVVKYS LVRISRICAA GEPGCGKASA SADQPAVACV PGGEGLYIDR
ISCVGCDNAS KRQFFGLRPV LVN