Gene GM21_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0567 
Symbol 
ID8135882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp692232 
End bp693662 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content60% 
IMG OID644868184 
Producthypothetical protein 
Protein accessionYP_003020399 
Protein GI253699210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value6.15788e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA CCCTGGCCCT CACCCTGATT CTCCTCAGTT CGCTCTTCAT CCCGCTCCTA 
GCCTCTGCCG AAAATATCCG CGCCTACATC TCCGACTTCA CCGTCTCCGG CGGCGACTCC
GCCGACCTTA AGACCACCTT GAAGCGCCTT ATGGCCTCGC GCCTTTCCGG AGACGGGCTA
ATCACCGTGG AGACCCCAGC CGAGGCGGAC GTTCTTATCT CCGGTAGCTA CACCATGCTC
GGCAAGATGT TCAGCCTCGA CGCCGCCGCC AAAAGCGCCC AAGGGAAGCA GCTTGCCGCC
GCCTACGAGC AGGGTGAAAG CATCGATGCC CTGATTCCCG CAGTCGGCAA AATTTCAAGC
AAACTCAAAG GCGAGATCGT GAAGCAGTAC CCGCAGGGAG CAGCTCCTGC TGCATCTTCA
CCTCAACCCT CCTCGGCTTT GATCAAGGCA CCCGCCTCCG AGTTGCTGCG TAACGAGGCC
GCTCCCGGCT GGACCAGCCA GCGCATCGCC GGGGCCAAGA GCGCGATCAC TTCTTGGGGC
ACCGACGAAA TCCTGGTCGC CGACCAGGAG GGGCTCTACC TCTACAAAAA AGACGGCAAA
CTGGCCCTGA TCGCGCAAGC AGCGTTAGCC CAGCACCAAA GGCTGCTCGC CGTCGACGCC
ATCGGCCCCG ACCAGGACGG CAAGGTGCTC GCTTTCGTCA CCATAATCGA CCGTGAGGCC
CCTTCATCCC GCGTTTACTC CATCCAGAAC GGCACGCTGA AGCCGGTGGC GCAGGACCTC
CCCTACATGT TCCGCGCGCT GGCGCCTTAC GGCGGCAAGA AGCAGCTCTT CGCCCAGGAG
ATGGGTCGCA CCGAAGATTA CTACGGCGAC GTTTACGAGG CGTCCTTCGC CGACGGGACG
GTGAAACTTG CAAATCCGAT CAAGATGCCG CGCTTCGCCA ACATCTTCAA CTTCAACCTG
TTCCGCGATC GGTCCGGGAA ATCCTACCTC ACCTGCTTCA ACGACAGCGG ATATCTGTTG
GTATATGCGG ACGGCGACGA AATCTGGAGA AGCAGCGATA AGTTCGGCGG CACCGAGACC
TATTTCCAGC GCCGCGACAT GGAAAACGAA AGGACCACGG GAACACCCTT CCGGACCAGG
TTCATCGACC AGCGCATCGC CGTTACCGAA AAGGGTGAGG TGATCGTCCC CCAAAACGCA
GGGTTCTTCG TGCTGGGCAA CGCCCGCTCC TACTCCAGAT ACTCCATGGT GGCCTTTGCC
TGGAACGGAT CGTCGCTTGA GGAACTCTGG AGGACCAAGG CGAGCCAGAA CTATCTGGCT
GACTACGACT TCAACCAACA AAGGCGCGAG ATGGTGCTTT TGGAGGTGAC CCAGAAGAGC
GGCCTGGGCG GCAAGGGCGG CAGTCTGGTC AGGATTCTCC GGGCGGAATA G
 
Protein sequence
MKRTLALTLI LLSSLFIPLL ASAENIRAYI SDFTVSGGDS ADLKTTLKRL MASRLSGDGL 
ITVETPAEAD VLISGSYTML GKMFSLDAAA KSAQGKQLAA AYEQGESIDA LIPAVGKISS
KLKGEIVKQY PQGAAPAASS PQPSSALIKA PASELLRNEA APGWTSQRIA GAKSAITSWG
TDEILVADQE GLYLYKKDGK LALIAQAALA QHQRLLAVDA IGPDQDGKVL AFVTIIDREA
PSSRVYSIQN GTLKPVAQDL PYMFRALAPY GGKKQLFAQE MGRTEDYYGD VYEASFADGT
VKLANPIKMP RFANIFNFNL FRDRSGKSYL TCFNDSGYLL VYADGDEIWR SSDKFGGTET
YFQRRDMENE RTTGTPFRTR FIDQRIAVTE KGEVIVPQNA GFFVLGNARS YSRYSMVAFA
WNGSSLEELW RTKASQNYLA DYDFNQQRRE MVLLEVTQKS GLGGKGGSLV RILRAE