Gene GM21_3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3063 
Symbol 
ID8138409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3552238 
End bp3553704 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content64% 
IMG OID644870663 
ProductNHL repeat containing protein 
Protein accessionYP_003022849 
Protein GI253701660 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.440395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCAAGC TAATTAGAAT TGGGTGTCTG CTGTTGCTGG GGTCGGCCTT GTGCGGCTGT 
TACGAGATGC GCCCCACCTC CGGCGGGGGA AAGCTCACCG TGCTGCCTGA GGACGGGGTA
CGGGCGCTGA GCCCAGAAGA CATCGCCCTG CCTTCTGGGT ACCGCATAGA GGCAGTAGCC
ACCGGTCTCA CCTTTCCCAG CGGCGTCGCC TTCGACGAGA GAGGCACGCC GTTTGTGGTC
GAGTCGGGGT ACTCCTATGG CGAGGTCTGG ACGGTGCCCC GGCTGCTCAA GGTCGAAAAC
GGCAAGAAGG TTACCGTGGC GCAGGGTGGG AACAACGGCC CTTGGACCGG AGTTGCGAGG
CTTGGCGGCT CCTTTTACGT GGCTGAGGGG GGAGAGCTTG AGGGGGGGAG GATCTTGCGC
ATCGACGCCG ACGGCAGCAT CAATCCCATC GTGGATGGTC TCCCAAGCCG CGGCGATCAC
CACACCAACG GCCCCGCGGC GGGCCCTGAC GGCTCGCTCT ATTTCGGCAT CGGCACGGCC
ACGAACTCCG CTGTGGTGGG GGAGGACAAC CTCAAGTTCG GTTGGCTCAA GCGAGACCCC
GAGTTTCACG ATACCCCCTG CGCCGACGTT ACGCTTAGGG GGGAGAACTA CGCCAGCGAC
GATTTTCTGG AGGGAAAGGG CCTGGTGGCA ACAGGGGCCT TCATGCCCTT CGGGATCTAT
ACCCTCCCGG GAGAGGTGGT CTACGGAGAA GTCCCCTGTA ACGGCGCGGT GTTGAAGGTA
CCGCCGTCGG GAGGGAGGCC GCAACTGGTA GCCTGGGGCT TTCGCAACCC CTTCGGGCTC
GCCTTCTCGC CGCAGGGAAA GCTTTACGTC ACCGACAACG GTTACGACGA GCGGGGCTCC
CGCCCCGTGT TCGGCGCAAG CGACGTGCTC TGGGAAGTAA CGCCGGGAAC CTGGTACGGG
TGGCCAGATT TCAGTGCCGG GCTCCCGCTC GATTACGGAA ACCAGTTCAA GCCTCCGCTG
CGGCATAGGC CGAAGCCGTT GCTTGCCGAA CATCCCAACG TCCCGCCGGC GCCCGCCGCG
ATTCTCCCGG TTCATGCCTC GGCGAACGGA CTCGACTTTT CCCGCAGCGA GCGGTTCGGC
CACCGCGGCG AGGTGTTCGT CGCGCTGTTC GGGGACCAGT CGCCCGGCAC CGGCAAGGTG
ATGGGTCCGG TCGGTTTCAA GGTCGTTCGG GTGAACGTGG CTGACGGGGT GATCAGGGAT
TTCGCCGTCA ACAAGGGGGA GAAGAACGCG CCCGCCTCGG CATTTGGGAG CGGAGGGTTG
GAGCGGCCGC TTGCCGCGCG CTTCGATCCA TCTGGGGAGG CGCTCTACGT GGTGGATTTC
GGGATGCTCA AGGAAACGGT GAAGGGTAGC ATCCCCATGA AGAATACCGG CGTCTTGTGG
CGCATAACGA GGGAAGATCA AGAGTAG
 
Protein sequence
MCKLIRIGCL LLLGSALCGC YEMRPTSGGG KLTVLPEDGV RALSPEDIAL PSGYRIEAVA 
TGLTFPSGVA FDERGTPFVV ESGYSYGEVW TVPRLLKVEN GKKVTVAQGG NNGPWTGVAR
LGGSFYVAEG GELEGGRILR IDADGSINPI VDGLPSRGDH HTNGPAAGPD GSLYFGIGTA
TNSAVVGEDN LKFGWLKRDP EFHDTPCADV TLRGENYASD DFLEGKGLVA TGAFMPFGIY
TLPGEVVYGE VPCNGAVLKV PPSGGRPQLV AWGFRNPFGL AFSPQGKLYV TDNGYDERGS
RPVFGASDVL WEVTPGTWYG WPDFSAGLPL DYGNQFKPPL RHRPKPLLAE HPNVPPAPAA
ILPVHASANG LDFSRSERFG HRGEVFVALF GDQSPGTGKV MGPVGFKVVR VNVADGVIRD
FAVNKGEKNA PASAFGSGGL ERPLAARFDP SGEALYVVDF GMLKETVKGS IPMKNTGVLW
RITREDQE