Gene GM21_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1000 
Symbol 
ID8136322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1180315 
End bp1181607 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID644868615 
Producthypothetical protein 
Protein accessionYP_003020823 
Protein GI253699634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones115 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGA ACATTATGAT GCCCACACCG AACATGGCTG CCTCTTCTGA GGCCTCGGAT 
GAACTGGTTA AACGGTTGGC AGAACTGAGC GAGGTGCAGT ATGACCAGGT GCGTAAAGAG
CAGGCGAAGG TACTTGGGAT TAGGCCGTCG ACACTTGATG CTGCAATCAA AAAGGCGAGA
AAGCCTTCTG ATACTGCGGC AATCCTCTTC GAGGAAGTCG AGCCTTCTCC CATGCCGGTC
GATCCTGGCA TCCTGCTGAG CGACCTAAGA GACACTATTC GCCGCTTCGT TGTCTGCACT
GAGCAGGCGG CTATAGCGGG TGCACTCTGG ATAGCCATGA CTTGGTTCAT AGATGTTGTC
TCGGTCGCAC CTCTGGCTGT AATTACCGCG CCCGAGAAAC GCTGTGGCAA GAGCATTTTG
CTTGGATTAT TTGAGAAACT GACCATGAAA CCCTTAGCGG CAAGCAATAT CACGCCGGCA
GCCTTCTTCC GTGCAATTGA TGCATGGTCT CCGACCTTGT TGATCGATGA AGCTGACGCC
TTTATGAAGG ACAACGAGGA ACTACGGGGG CTGCTCAACA GCGGACATAC GAGGGCCTCT
GCCTACGTGA TCCGCACGGT CGGTGACAAC TTCACGCCAA CTAAGTTTAA TACCTGGGGA
GCCAAAGCCC TGGCAGGCAT CGGAAACCTA CCTGGTACCG TGATGGACCG CTCCATTGTC
TTAGAACTGC GGCGCAAGCT CGCTACTGAG ACAGTTGAGC GCCTTCGGTA TGCTGATGAG
CAAGTATTTA AGGATCTCAA GGGGAGGCTG GCGAGGGTTA GGGAGGACTA CATGGAGACG
TTACACAAGG TTCGCCCACC CTTGCCGGCG CAACTCAACG ACAGAGCGAT GGACAACTGG
GAGCCACTGC TGCAGATAGC AATGCTAGGT GGCGAAGCCT GGTTTGAAAC TGCCACTAAG
GCGGCAATTA AGATAGCAGC CAAGGAAACT GGGACTATAA CGACTGGTAC CCAGTTGCTG
GTAGACATAA GGGATATCCT CAACTCGAGA AGCAGCGACC GAATCAGCTC CCAGGACCTG
ATCGACGCAT TATGCCTGGA CGCCGAGTTG CTGTGGTCCA CCTACAACAG GGGAAGGGCG
ATCTCTACTC GTCAGCTTGC GATGTTGCTT AAACTGTTTG GCATCCATAG CAAGACAATC
CGTTACAACA ACAGCACGGC TAAAGGCTAT GAAGCGGAGC AATTCCTCGA TGCCTTTTTC
CGTTACATTC CCCCTGTCAC TCTAGAGCAG TAA
 
Protein sequence
MQSNIMMPTP NMAASSEASD ELVKRLAELS EVQYDQVRKE QAKVLGIRPS TLDAAIKKAR 
KPSDTAAILF EEVEPSPMPV DPGILLSDLR DTIRRFVVCT EQAAIAGALW IAMTWFIDVV
SVAPLAVITA PEKRCGKSIL LGLFEKLTMK PLAASNITPA AFFRAIDAWS PTLLIDEADA
FMKDNEELRG LLNSGHTRAS AYVIRTVGDN FTPTKFNTWG AKALAGIGNL PGTVMDRSIV
LELRRKLATE TVERLRYADE QVFKDLKGRL ARVREDYMET LHKVRPPLPA QLNDRAMDNW
EPLLQIAMLG GEAWFETATK AAIKIAAKET GTITTGTQLL VDIRDILNSR SSDRISSQDL
IDALCLDAEL LWSTYNRGRA ISTRQLAMLL KLFGIHSKTI RYNNSTAKGY EAEQFLDAFF
RYIPPVTLEQ