Gene GM21_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0053 
Symbol 
ID8135352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp64464 
End bp66569 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content63% 
IMG OID644867670 
Productprotein of unknown function DUF323 
Protein accessionYP_003019898 
Protein GI253698709 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000000456443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCTGA GAAAAACACA TTCGGTAATA CTGACAGAAG GGGACCCCGA ACAGAAAAGG 
GCCGAGATAC TTCACTACTT TCATGCCACC TTCACCATCG ACGAGCGCCT CTATGAGACC
CTCAAGGACG AAACGTCCTT TTACCTTAAG GGGGACCGGC TCCGGCATCC GCTCGTCTTC
TACTTCGGCC ATACGGCCGC CTTCTTCGTC AACAAGCTCA TCATCGCCCG GGTCATAGAC
CGGCGCGTGA ATCCCCGTTT CGAATCGCTC TTCGCCGTAG GCGTCGACGA GATGTCCTGG
GACGACCTGA ACGATTCGCA CTACGACTGG CCCACCCCCG GCGAGGTGAA GCGTTTCCGG
GACCAGGTGC GCGACCTGGT GGACGGCCTG ATCAAGACGC TGCCGCTTAC CCTTCCCATC
ACCTGGGAGC ACCCCTTCTG GGCAATCATG ATGGGGATCG AGCACGAACG GATCCACCTG
GAGACCTCGT CCGTGCTGAT CCGCCAGCTT CCCCTGGACC GGGTGCGGCG CCACGACTTC
TGGGAGATCT GCCGGGAATC CGGCGAGCCC GCGGCGAACG AACTGCTGCC GGTTTCCGCC
GGCACGGTGA GGCTCGGGAA AGAGGAGGGG CATCCCCTCT ACGGTTGGGA CAACGAGTAC
GGAAGGCGCG AGGCGAGGGT TGAGGCGTTC TCCGCCTCGA AGTTCCTGGT CTCGAACCGC
GAGTACCTGG CCTTCGTGGA GGCTGGGGGG TACCTGGAGC GGGGGTGGTG GACGGAGGAG
GGGTGGAGCT GGCGCAGCTT CAAGGAGGCG GAGCATCCTC TTTTCTGGGT GGAACGCGAA
GGGGGGTGGG GGCTCAGAAC CATGCTGGAG GTCGTCGACC TTCCGTGGGA CTGGCCGGTC
GAGGTGAATT ACCTGGAGGC GAAGGCGTTC TGCAACTGGC TCTCCGCTAA AAGCGGCAAA
TCGATCAGGC TCCCGACGGA GGATGAGTGG TACCGCCTGT GCGAGCTGGC GGGGGTGCCG
GACCAGCCCG GGTGGAACCG GGCGCCGGGG AACATAAACC TCGAATACTG GGCTTCCTCC
TGCCCGGTGG ACCGCTTCGC CTTCGGCGAT TTCTTCGACC TGGTCGGGAA CGTTTGGCAG
TGGACGGAGA CCCCCATCTA TCCCTTCCAC GGCTTCCGGA TTCACCCCTG GTACGACGAT
TTCTCGACCC CCACCTTCGA CACGCGGCAC AACCTGATCA AGGGGGGCTC CTGGATCTCC
ACCGGCAACG AGGCGACCCG CGATTCACGC TACGCCTTCA GGAGGCATTT CTTCCAGCAC
GCCGGTTTTC GCTACGTCGA AAGCGCGCAC CCTGTCGAGA TCCACGAGGA CCCGTACGAG
ACGGACGCGC TTGCCGCGCA GTACTGCGAC GCCCACTACG GCCCCGAGCA TTTCGGGGTC
CCCAACTTCC CCAGGGCCTG CGCGGAGATC TGCCTGGAGC TGACCCGCGG GAGCTCCCGG
GGGCATGCAC TCGACCTGGG GTGCGCCGTC GGGCGCGCGA GCTTCGAGCT GGCGCGCGGC
TTCGACCAGG TGACGGGGCT CGATTTTTCG AGCCGCTTCT TCCGCCTGGC CGCGAGGATG
CAGGAGGAGG GGGGTCTGCG CTATGCCCTG CCGGAGGAGG GGGAGGTGGT TTCCTACCAC
GAGCTGGAGC TTGAGGATCT GGGGCTCAAG GAGGTCAGGG AGCGGGTGCA ATTCTTCCAG
GCGGACGCCT GCAACCTCCC CGACAAGTTC ACCGGCTACG ACCTGGTGCT GGCCGCGAAC
CTGATCGACC GGCTCTATTC GCCCCGCCGC TTCCTGAAGG CGATACGCGA AAGGCTAAAC
TCCGGCGGGC TCCTGGTGAT CGCGTCCCCC TACACCTGGC TTGAGGAATA CACGAAAAAG
GAAGAATGGC TGGGAGGTTA CCGGGAAGCG GGAGAACCGG TGTGGACTAT CGATGGACTA
TCGCGGGAAC TTTTGCCATA CTTCACGCCG CTCGGTGCTC CGCGCGAGAT CCCGTTCGTG
ATCAGGGAGA CGCGCCGCAA GTTCCAGTAC AGCATCGCGC AATTGACGGT ATGGGAGCTT
AAATGA
 
Protein sequence
MDLRKTHSVI LTEGDPEQKR AEILHYFHAT FTIDERLYET LKDETSFYLK GDRLRHPLVF 
YFGHTAAFFV NKLIIARVID RRVNPRFESL FAVGVDEMSW DDLNDSHYDW PTPGEVKRFR
DQVRDLVDGL IKTLPLTLPI TWEHPFWAIM MGIEHERIHL ETSSVLIRQL PLDRVRRHDF
WEICRESGEP AANELLPVSA GTVRLGKEEG HPLYGWDNEY GRREARVEAF SASKFLVSNR
EYLAFVEAGG YLERGWWTEE GWSWRSFKEA EHPLFWVERE GGWGLRTMLE VVDLPWDWPV
EVNYLEAKAF CNWLSAKSGK SIRLPTEDEW YRLCELAGVP DQPGWNRAPG NINLEYWASS
CPVDRFAFGD FFDLVGNVWQ WTETPIYPFH GFRIHPWYDD FSTPTFDTRH NLIKGGSWIS
TGNEATRDSR YAFRRHFFQH AGFRYVESAH PVEIHEDPYE TDALAAQYCD AHYGPEHFGV
PNFPRACAEI CLELTRGSSR GHALDLGCAV GRASFELARG FDQVTGLDFS SRFFRLAARM
QEEGGLRYAL PEEGEVVSYH ELELEDLGLK EVRERVQFFQ ADACNLPDKF TGYDLVLAAN
LIDRLYSPRR FLKAIRERLN SGGLLVIASP YTWLEEYTKK EEWLGGYREA GEPVWTIDGL
SRELLPYFTP LGAPREIPFV IRETRRKFQY SIAQLTVWEL K