Gene GM21_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1007 
Symbol 
ID8136329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1186285 
End bp1188285 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content55% 
IMG OID644868620 
Producthypothetical protein 
Protein accessionYP_003020828 
Protein GI253699639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones113 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTTT TAGCCTTTTG TAGCCACTAC CTGCCGCCGA TTGCGTGTGG TGGTACCCTC 
TGCTGGCTTG CCTGGAAATC TAACAAGGAA TTCATTCAAC CATCCAAGAT TCTTAAACGA
GAACTGACTG AAGCCATCAA AAACCTGAAG GAATTGCGAC ATGGTGCCAG TGGCCAGCCG
GTCACCGATC TTGCCGTTAT AGCGGATAAG GTCATGACAG GAGAAGTGCT TCGCCATCTC
TGGAGCGAGT ACCAAGAGAC GCTGCACCCG CAAAAAGGGA TAGATACCGA CGGTTTCGAG
CGCATTGTCC GCTACCGCTC GACCGCCCTC TCTGAGACTT TTTTCACCGA CCAGGCACTA
GTAGACACAC CGCTAAAGAC TGCTTTTTTC AAACATCTAC CGGGTATCCT GACTGGAATT
GGCATCATAG GGACGTTCTT TGGACTTATC ACTGGGCTAA AAAGCTTTGA GGTTTCCGCC
AACGCCGATC TTGTCAGAAA GAGCCTTAGC GGTCTTTTGT CCAGCGTAGG CGATGCCTTC
ACAGTTTCAC TGATGGCGAT CGCATCCGCC ATGATTTTCA CTTTTATCGA GAAAACAGCG
GTTACTCAAT GCTATGGACA CGTTGAGGAG CTTACCCAGC TCATCGACAG CTTGTTTGAC
GCGGGAGCCG GCGAGGAGTA CCTTTCCCGC TTGGTGGCGG CCTCCGAAAC ATCTGCCACG
CAAGCGACCC AACTGAAAGA CGCGCTAGTG TCAGAGTTCA AGCAGATCAT GGCTGAAGTC
ACCGAGTTGC AGGTCTCCGC AGCCGCTAAA CACAGCTCGG CCATGTCCTC CGCCATCGTC
GAGAGCTTCA CCGAAAGTAT ATCCGAGCCA ATGGAGAGGA TCTCGCGGGC GGTGGATAAC
GTCGGGACCA ACCAAGGGGA TGCGGTCAAC AGGCTTCTAA CCGACGTGCT TGCTAACTTC
AGTTCCCAGA TGGAGGGTAT TTTTGGGGGG CAACTGCGCG GGATAAACGA GCTCCTGATC
CAAACTACCG AGACGATGCA GGGTGTGTCG TCGAGATTCG AAACGGTCGC CGCTGGGGTG
CAAACTGCGG GGGAGAGCGC CGCAGACGCC ATGGCGGAAA AACTCTCCCA AGCCATAAGC
TCCATTGAGG CAAGACAAGA GATCATGAAT GCCCAGATGG GCGAATTTGT GGTGCAGATC
AAAAATCTTG TCCATGAGTC GCAAACCGAG ACTTCCCAGA AGATGCAGGG GATTCTGGCC
GATTTGGGAG AAAAGGTGTC CGGGATGGTG ACGCAGTTGG AGGAGCAGTC CCGCGAGAGT
ACCAAGTTGC ACCAGACCAG CCAGATCCAG TTTGCAGAGA CGACGAACGC AACGGTCGAC
GGGATCGGCG GCATGGTACA AGCACTCGCC GAGGAGGTGC AGTCAGCTAG CGATGCCATG
CGGCAGAGTG TGTCCAGCCT GTCCCAGTCA AGCCGGGAGT CGATTGACAA GCTCAACAGC
GGTGCCGAGG TACTCTATAT GGCCTCCAGC GAATTCGCTA AGGCAGGCAA AGGGGTAACG
GACACGGTCC GGGAAAGCGG CCGAGCGGTC GAGACGATTA CCGGGGCGAC AAACCTGCTA
GGGGCAATTG TGAAGGACGT CCGGAGCATC CTGAGTGAAA ACGAAAGGGC GAAGGATACT
TTCGGCGCGA TGGTGAACGA CCTCAGGAGC CTGGTGGAAA ACGCAAAACG CGAGGCCTCA
ATGTCTGAGG AGGTGATCGC CTCCATCAGG CACGCCGCCG AGCAGCTGAG TTCCGCAGGA
CAGCAGGCCG AAGATTACCT GCGGGGAGTC ACCGAAGTCT TGGGCAATGC CCACGTGGAG
TTCGCCCGCA ACATCGAGCG TACCCTGAAC CATGGCAATA CTACGTTCCA AAAGGAGCTC
TCCATGGGGG TTGGCTTGCT CAACATCGCA ATCAAAGAAC TGGGAGACAC CATCGACGAG
TTCCCCAGGG GTAACGCGTG A
 
Protein sequence
MDVLAFCSHY LPPIACGGTL CWLAWKSNKE FIQPSKILKR ELTEAIKNLK ELRHGASGQP 
VTDLAVIADK VMTGEVLRHL WSEYQETLHP QKGIDTDGFE RIVRYRSTAL SETFFTDQAL
VDTPLKTAFF KHLPGILTGI GIIGTFFGLI TGLKSFEVSA NADLVRKSLS GLLSSVGDAF
TVSLMAIASA MIFTFIEKTA VTQCYGHVEE LTQLIDSLFD AGAGEEYLSR LVAASETSAT
QATQLKDALV SEFKQIMAEV TELQVSAAAK HSSAMSSAIV ESFTESISEP MERISRAVDN
VGTNQGDAVN RLLTDVLANF SSQMEGIFGG QLRGINELLI QTTETMQGVS SRFETVAAGV
QTAGESAADA MAEKLSQAIS SIEARQEIMN AQMGEFVVQI KNLVHESQTE TSQKMQGILA
DLGEKVSGMV TQLEEQSRES TKLHQTSQIQ FAETTNATVD GIGGMVQALA EEVQSASDAM
RQSVSSLSQS SRESIDKLNS GAEVLYMASS EFAKAGKGVT DTVRESGRAV ETITGATNLL
GAIVKDVRSI LSENERAKDT FGAMVNDLRS LVENAKREAS MSEEVIASIR HAAEQLSSAG
QQAEDYLRGV TEVLGNAHVE FARNIERTLN HGNTTFQKEL SMGVGLLNIA IKELGDTIDE
FPRGNA