Gene GM21_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0035 
Symbol 
ID8135334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp46821 
End bp48770 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content64% 
IMG OID644867652 
Producthypothetical protein 
Protein accessionYP_003019880 
Protein GI253698691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.638764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGC GACTGCATCT CGAAAATTTC GGCAACGTCC ACGCCCTTCC CATCCTGCAC 
TACCGGATGG AGTTCGCGCA TCTGGTACGG GAAGCGTATG AGGTCCTGAA ACCGGACTGC
ATCGCCATCG AGCTCCCCCG GACCCTGGAG CCGCAGTTCC TGCGCGCCGT CGCGCGCCTC
CCCGAGCTTT CCGTCCTCGC CTACCACGTC GCCGGCCAAT CCGTCTTTCT CCTCGTCGAA
CCCGCGGACC CTCTCATCGA GGGGGCGCGG CTCGCGCTCA AGCACCGCAT CCCGCTGCAC
CTGGTCGACA TCGACCTGGA CAGCTACCCG TCCCACGACG AGCAGCTTCC CGACTCCTAC
GCGGTGCAGC GCATCGGGCT GGAACCGTTC TACCGGGAGG TGGAAAAGCT CTACCGCGAG
CTTGAGCCTT GCGACGAGGA TCTGCGCCGC GAGCGCGGCA TGGCCCACAG ACTGCAGCAG
CTCTCCGCGC AGCACCAAAG GGTCCTTTTC ATCTGCGGCA TGTCGCACCT GGAGCGGATC
CGGGAGAACT TCGGGAAACC TCTGGCGCAG CCTCTCACCC GCACCCACCG CGAAGGGGTC
GCCATCTTCA ACCTGCACCC GGAATGCTGC CACGAGGTCC TGGCCGAGTA CCCGTTTCTT
TCCTCCCTTT ACGAGACCAG GAGATCGCCG CTTCCTCCCG AACCCGCACA GGCGGCCTCC
TTGCGGAAAA GCTTCAACGC CTTCGAGCTG ATCCTGGGAG GGAAGCAGTC GATCCCCGAA
GAGCAGGCGC TCTTGGAGTC GATCCAGCGC AGCGCCCACC GCGTAGGGAG CGAAGGGGAG
ATGCCGGACC GCCAGAAGGT CATGCTGCGG CTCTTCCTGG AGGCGGCGCG CCACTATCGC
CAGGAGACCG GGGACAAGGT CCACTACTGG CAGAAGCGGG CCTTCTTCCG CTTCGTCAGG
AACTACGCGC TCCTGTCGCA GATGCTCCTT CCCGACCTGT ACCAGATGCT CGCCGCCGCG
CGCGGCTGCC TCGACGACAA CTTCGCCTAC GCCTTTTTCC GGCTTGCGGC GCACTACCCC
TGGCAGAGCG AGCAAAGCGA CATCCCCACC CTGAGGCTTT CCGCAGCCGA GCTCTCCGCG
GGGACCCGCA GGATACGCTT TCGCCCCCGG GAGCAGGTAC GCGGCAAGGG GCGCTCCGGC
ATCAAGATGA CGAACCGGCG CAAGGAGAAG CGCCCCGGCG ACTGGTTGGA GGGGTTCGAC
GACCCGTATA TCTGCTCCTA TCCCCCCGAG GACTTAAGCA TTGAGGAGTA CGGCCGCAAC
CTGAAACGGA TCGGTGCGCG GCAGTTGAGC GAGGAGGCGA GCAGGACCGA GCCTTTCTCG
GCGTCGCTTC TGGACGGGAT CGACATGCGC GAGACCATCA GGAACCTGCA CGAAGGGAAG
ATCTACGTAA AGGAAAACAA GCGGTTAAAG AGCGGGGTAG GGTGTGTGGT GGTGGTCTTC
GACGAGGACC GGGAGGACTC GGGGTATCCG TACTGCATGA CCTGGCTCGG CGAGCACGAC
CAGGAGTCGG ACATGGCATT TTACGCCACG CCTCCCACCG ACAACATCGT CGGCCCCGGC
ATTTCCCGCT GCGAGTACGG CGGATTCCTG TTAAGCTACC CGCCGCGCCG GATGCACGAC
GTCTGGCAGG ACCCGGATTA CCGCGGGGCC CTGGGGAAGG GGGAGGTGCT TTTGATGGCG
GCGCTGGATT ATTCGCTGGA GAAGGACGTG GTATATGCGG CGGCGAAGCC GCCGAGGAGC
TATCTGAAGC AGCAGGCGGC TCGGCTGGGA AAGAGGATCA TCTATCTGCC GCTGGGGAGC
CTCTCGCCGG TGGCGCTCAA GAGGCTCCGG GCGTTTCACA TTCTCTACGG CAAGGACAAG
CGGGACATCG CCAAGGAGTA TATCTGGTAA
 
Protein sequence
MPERLHLENF GNVHALPILH YRMEFAHLVR EAYEVLKPDC IAIELPRTLE PQFLRAVARL 
PELSVLAYHV AGQSVFLLVE PADPLIEGAR LALKHRIPLH LVDIDLDSYP SHDEQLPDSY
AVQRIGLEPF YREVEKLYRE LEPCDEDLRR ERGMAHRLQQ LSAQHQRVLF ICGMSHLERI
RENFGKPLAQ PLTRTHREGV AIFNLHPECC HEVLAEYPFL SSLYETRRSP LPPEPAQAAS
LRKSFNAFEL ILGGKQSIPE EQALLESIQR SAHRVGSEGE MPDRQKVMLR LFLEAARHYR
QETGDKVHYW QKRAFFRFVR NYALLSQMLL PDLYQMLAAA RGCLDDNFAY AFFRLAAHYP
WQSEQSDIPT LRLSAAELSA GTRRIRFRPR EQVRGKGRSG IKMTNRRKEK RPGDWLEGFD
DPYICSYPPE DLSIEEYGRN LKRIGARQLS EEASRTEPFS ASLLDGIDMR ETIRNLHEGK
IYVKENKRLK SGVGCVVVVF DEDREDSGYP YCMTWLGEHD QESDMAFYAT PPTDNIVGPG
ISRCEYGGFL LSYPPRRMHD VWQDPDYRGA LGKGEVLLMA ALDYSLEKDV VYAAAKPPRS
YLKQQAARLG KRIIYLPLGS LSPVALKRLR AFHILYGKDK RDIAKEYIW