Gene GM21_0495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0495 
Symbol 
ID8135805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp610535 
End bp611653 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content65% 
IMG OID644868114 
Productprotein of unknown function DUF34 
Protein accessionYP_003020333 
Protein GI253699144 
COG category[S] Function unknown 
COG ID[COG0327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0420348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACTC CAAGAGTTTC AGATATATCC GGAATCACTG GCAAAATTGC CCCGACCCAC 
CTCGCCGAGT CCTGGGATAA CGTGGGGCTG CAGCTTGGGG ATCCCTCCTG CCAGGTCTCG
CGGATCATGG TCGCGCTGGA TCCCGGCCGT CCCGCCATCG AGACGGCCGT CGAGGCCGGT
TGCCGGCTTC TGATAACGCA CCACCCCTTC ATCTTCACCC CCCTCAAGAA GATCTCCACC
GCTGATGAAA CCGGACGCCT CGCCATACTC GCCCTGAAAA ACGATCTCTC CATCATCTCG
CTGCACACAA ACTTCGATAT AGCCCCAGGC GGCGTGAACG ATCTATTGGC CGGGCTGCTC
GGCGTCCAGG AGGCGCAGCC GCTCAGGATC ACCGGCGGCG ACGAGTACGT GAAGATGGTC
CTTTTCGCGC CGCGCGGCTG CGAAGAGAAG CTTTTAGGTG CGCTTTCCCC CTTCATGCCT
CACATCGGCA ACTACCGCGA TTGCTCCTAC CAGGGGGAGG GGACCGGGAG GTTCACGCCG
CTTCCGGGGG CGCGTCCGTT CGTCGGAGCG GTTGGGGCGA GCCATGCCGA GCCCGAGAGC
AGGCTGGAGC TCTTGCTGGT CAAGGAACGT ATCGCCGCCG CGGTCGCGGC GCTCAAGGGG
GCGCATCCCT ACGAGGAGCC TGCCTACGAT CTTTACCCGG TGCTGAACCG TGGCGAGGCG
TACGGGCTCG GCAGAATCGG AAAGCTGGCG GAGCCGGTGA GCGCCGGCGC CTATGCGCTG
CTGGTCAAGG AACGGTTGGC GGCGACCGGG GTGCGCCTGG TGGGCGACCC GGCGCGGCAG
GTGAAGAAGG TGGCCCTTTG CGGCGGCTCC GGCGCGTCGC TCATCCACGA GGCGCAGCGC
AAGGGGGCCG ATCTTTTGGT CACAGCGGAT GTGAAGTACC ACGAGGCGCG CGAGGCCGAA
GCGCTGGGCC TGGCGCTTCT TGACGCCGGG CATTTCTCGA CCGAGTACCC CATGGTTCGT
GGGTTGGCCG GGCAGCTCAG AGCCGCCCTT AAGGCAAAGC GGTTCGAGGC GGAGGTTTTG
GAGTACCAAG GAGAGCGCGA GCCATTCAGT TTTTGGTAG
 
Protein sequence
MITPRVSDIS GITGKIAPTH LAESWDNVGL QLGDPSCQVS RIMVALDPGR PAIETAVEAG 
CRLLITHHPF IFTPLKKIST ADETGRLAIL ALKNDLSIIS LHTNFDIAPG GVNDLLAGLL
GVQEAQPLRI TGGDEYVKMV LFAPRGCEEK LLGALSPFMP HIGNYRDCSY QGEGTGRFTP
LPGARPFVGA VGASHAEPES RLELLLVKER IAAAVAALKG AHPYEEPAYD LYPVLNRGEA
YGLGRIGKLA EPVSAGAYAL LVKERLAATG VRLVGDPARQ VKKVALCGGS GASLIHEAQR
KGADLLVTAD VKYHEAREAE ALGLALLDAG HFSTEYPMVR GLAGQLRAAL KAKRFEAEVL
EYQGEREPFS FW