Gene GM21_0982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0982 
Symbol 
ID8136303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1160705 
End bp1161895 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID644868596 
Producthypothetical protein 
Protein accessionYP_003020805 
Protein GI253699616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.364369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCC TCGCCAACAC AGTAAGCGTC TGCCATTTCA AGGTCCAGGG AGAACTCCCG 
AACCAGGACC TCTACACCTG GATCACCAAA CAACTGGCGG CCAACCGCTT CAACCCGATC
GACCAGGGAA GCGAAGAAAT GTCGATCGGC TGGGTCCACC TGGACGACCC GAAAGCGTCC
GACTTCGAGA CCCCCGCGGC CTGCTGCCGC GAACACTACC TCATGTTCAC CCTGCGCCGC
GACAAGCGCT CGGTCCCCTC GGCCATCCTG AAGGCACACC TGGAGAAGGC GCAGGACGAG
TTCCTCGCCG AGAACCCCGG CTTCGTCAAG GTACCCAAAC AGAAGCGGGA GGACCTGAAG
GAAGCGGTGC AGGCGATGCT CCTGTCGCAG ACGCTCCCGA CACCCGCCAC CTACGACGCG
GTCTGGGACA CCAGAAGCGG CATCCTCACC TTCACCTCGC TTTCTCCCAA GGTCATCGAA
CTCTTCGAGG AGCAGTTCAA GAAGACCTTC GAAGGGCTCC GCATCTCCGC GTTCCACCCC
TACGCCCGCG CCGAAAACGT GCTGGACGAG GGGAACCAGG TGCTCCTCAA AAAGGCCAAC
AAGGCGGGCG GCGACAACTA CCTGGAGCTG ATCAAGGAGA ACCAGTGGCT GGGCACCGAC
TTCATGCTCT GGCTCATGTA CCAGACCATG AACGAGGCCT CCGAGTACAG CGTGAACCAG
GAAGGGATCC TGCTGGCCAA GGAGCCGTTC GTGGCCTACC TGGACGACCG CGTGGTGCTT
CTGGGCTCCG GCGAGAACGG CGCCCAGAAG ATCACCGTGG CCGGGCCGCA GGACCACTTC
AACGAGGTGA GAAGCGCGCT ATTGAACAAG AAGCAGATCA CCGAGGCGAC GCTGCACCTT
GAGACCGGCG ACGACCACTG GAAGCTGACG CTCAAGGGCG AGCTCTTCCA CCTGGCGTCC
TTCAAGAGTC CGGCGGTAAA GCTGGAAAAA GACAGCAGCG TGGACGAGGC GATGGAGCGG
GAGGCGGTTT TCTTCGAGAG GATGATGCTA TTGGAGAAGG GGACCCAGCT TTTCGATTCG
GTGTTCGCCA CCTTCCTGAA ACTCAGGCTC GGCAGCGAGT GGGTCGAGCA GGAGCAGGCG
ATCCAGAAGT GGCTCAACGT CTGCAGCTTC TGCAACGGGT CGCTCGCGTA G
 
Protein sequence
MGILANTVSV CHFKVQGELP NQDLYTWITK QLAANRFNPI DQGSEEMSIG WVHLDDPKAS 
DFETPAACCR EHYLMFTLRR DKRSVPSAIL KAHLEKAQDE FLAENPGFVK VPKQKREDLK
EAVQAMLLSQ TLPTPATYDA VWDTRSGILT FTSLSPKVIE LFEEQFKKTF EGLRISAFHP
YARAENVLDE GNQVLLKKAN KAGGDNYLEL IKENQWLGTD FMLWLMYQTM NEASEYSVNQ
EGILLAKEPF VAYLDDRVVL LGSGENGAQK ITVAGPQDHF NEVRSALLNK KQITEATLHL
ETGDDHWKLT LKGELFHLAS FKSPAVKLEK DSSVDEAMER EAVFFERMML LEKGTQLFDS
VFATFLKLRL GSEWVEQEQA IQKWLNVCSF CNGSLA