Gene GM21_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2000 
Symbol 
ID8137334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2318202 
End bp2319719 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content66% 
IMG OID644869613 
Producthypothetical protein 
Protein accessionYP_003021810 
Protein GI253700621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.94208e-35 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAGT CACTGCTTAT TCTTTTCCTT GCGCTCATGG TCCCGCTGAC CTTGAGCCAA 
GGCACTTCAG CCTGGGCCGA GAGCTACCAG CAGGAGTATT ATCAGACGGG CGCGGACCTA
CTGATGCCGG ACGAACTCGA CGAACTGCTG GCCCCCATCG CCCTCTACCC CGACCCGCTG
ATCGCGCAGA TCCTGCCGGC TGCGACCTTC GTGGACCAGA TAGACGACGC TGCGCGCTTC
GTCAGGCGTT ACGGCCCCAG CCGGGTCGAC TACCAGAGGT GGGATTTAAG CGTCAGGGCC
ATCGCCCACT ATCCGCAGGT CATCTACATG ATGGATCGGA ACCTCGATTG GACGGCGTCG
CTCGGGCAGG CCTTCATCGA ACAGCCCCAG GAGGTCATGG ACGCCATACA GAGGCTGCGC
GACGACGCCC GGGCGGCCGG CAATCTCTAC AGTACGTCTC AGCAGTTGGT GATCGTCGAG
GCGGGGATCA TCAGGATCGT TCCGGCCAGA CCGCAGTACG TCTACCTGCC GGTTTACGAC
CCTTACGTGG TTTACTACGA GCCCTACTCC CCCTCCTACC CCTTCATCAC CTTCAGCGTG
GGATTCACCA TCGGGGCCTG GCTCAACCGC GACTGCGACT GGCGACGGCA TCGGGTGTAC
TATCACGGCT GGCGCGGCAC CGGATGGGTA AGCAGGTCGC GCCCGCACAT CCACGACCGG
CGCGGGATCT ACATCAATCA GCGCGCCTCC AGCATCACCG TCAATACCCG GGTAGCCCAG
CGAGACACCC GGGTCTACCG TCGGGAACTG CGCAACGAGG GGCTGCGCTG GCGCGGCAGG
ACAGGCGGGC GCAGCGAACC CGCCCGCGTG CAGACGCGCC GCGGGACCGC GACGCCGGGC
GCGAGGATCG AACAGCCGCG CCCGGGTCGA GTAGAACAGC GGCGCAGGGA GCAGACGCCT
CAGCCTGGTC CCGGTCGAGT AGAACCGCGA CGTCAGATAC AAGCTCCGCA GCTGACTCCG
GGCCGGGTCA AACAGCAACG CCCCGGCAGG ACCGAGAGCG CACCTTCGGG ACGGACACAG
CCACGCCCGG GGATAAGGGA GGGGGCGCCG TCCGGCGAGA ACCAGCAGCA CCGCCCTTCG
AGGATAGAGC AGCATAAGGC CCCGGCCGTG CAGCAGTCAC CTGCGGCAAC AGGCCGGCAG
CCCGCCGGCA CGGAGCAAAA GGTGCGGCGC CGGCAACTAA GACAGATAAA CCGGCCGCCC
GCTACCCCTG CCACCGAGGT CCCGCGCGCG ACCGTCCCGG CGCCGAACAC AGCTACCCCG
CCGACGAGGG TTACCCCACC CCCGGCCGCC AAACCTCCGG TCGTCCCGGC GCCGGCATCC
CCGGCTGCGC CCAGGGAAAT AGAGACGCCT CGCCCCTCAA GGCCGGAACG CGAGCAAGGA
GAGGGTTCAG GAAGAGGCGG CCGTGGCGGA GGTGAAAGGG CGGAACCAAG AGGAGGCGGT
CAGAGAGAAG GAAGGTAG
 
Protein sequence
MKKSLLILFL ALMVPLTLSQ GTSAWAESYQ QEYYQTGADL LMPDELDELL APIALYPDPL 
IAQILPAATF VDQIDDAARF VRRYGPSRVD YQRWDLSVRA IAHYPQVIYM MDRNLDWTAS
LGQAFIEQPQ EVMDAIQRLR DDARAAGNLY STSQQLVIVE AGIIRIVPAR PQYVYLPVYD
PYVVYYEPYS PSYPFITFSV GFTIGAWLNR DCDWRRHRVY YHGWRGTGWV SRSRPHIHDR
RGIYINQRAS SITVNTRVAQ RDTRVYRREL RNEGLRWRGR TGGRSEPARV QTRRGTATPG
ARIEQPRPGR VEQRRREQTP QPGPGRVEPR RQIQAPQLTP GRVKQQRPGR TESAPSGRTQ
PRPGIREGAP SGENQQHRPS RIEQHKAPAV QQSPAATGRQ PAGTEQKVRR RQLRQINRPP
ATPATEVPRA TVPAPNTATP PTRVTPPPAA KPPVVPAPAS PAAPREIETP RPSRPEREQG
EGSGRGGRGG GERAEPRGGG QREGR