Gene GM21_3162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3162 
Symbol 
ID8138514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3672917 
End bp3674104 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content59% 
IMG OID644870767 
ProductNHL repeat containing protein 
Protein accessionYP_003022947 
Protein GI253701758 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones119 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGCA ACATCACCGC CAACTGCCGT CGTCATCTTG CCAAAAGCTC CATCGTTGTG 
ATTGCCTTGC TGCTCTCAGC CTGTGCTTCC CAAAAGAAGG CCTTCGCGCC GGTCTTTTTC
CCGCCCGAGC CGAGCCCCCC CAGAATCCAG TACCTGATGG GAATCTCGGT ATCGACCGAC
GTAGAGGAGG AGAAGAAAAA CGACTTCTCC CTGGTCGTCA CCGGCAAGGG AAGCTCCGAG
GCCAAGCGCA TGATCGCCAA GCCCTACGGC ATTACCAGCG CCAACGGCAA GATCTTCGTC
TGCGACATCG GTGTGGGCAA CCTCGTCGTC ATCGACCCCG CAAAAAAGAC CTTCGATTAC
CTGAAGGGGA ATAACGGTCT GGGCAAGCTG AAAAAGCCGG CAAACGTAGG CCTGGACCAA
TACGGGAACC TCTTCGTCGC CGACGCCGCC AGGAGAGAGA TCATGGTCTA CAACGCGGAG
GGGCAGTTCG TGCGCTCCTT CGGCAAAGAA GAGAACATGA AGCCCTCCGA CGTGGCGATC
GACGGCGACC TGGTTTACGT CCTCGACCTC CAGAAGACCA AGAACGAGAT CAAGGTGTTC
GATCGCGAGA GCGGCAAGCT GACTAACACT TTCGGCAAGC GCAGCGACAA CGCAGAGGGG
GTCAACATCC CCACCAACTT CACCATGGAC CACAAGGGGG ACATCTACGT CACCAACGCC
GGCAACGGCA AAGTGATGAA GTTCGACCGC GACGGGCATC TGCTTCTCAC CTTCGGGGAC
CTGGGGGACA TCTCCGGCAT GTTCTCGCGT CCCAAGGGGG TGGCAGTCGA CCGTGAAAAT
CGCATCTACG TCGTCGACGG CGGCAACCAG AACGTGCAGG TGTTCAACGA AAAGGGGCGC
ATTCTCACTT CCTTCGGCGA TCCGGGCTTG ATGCGCGGGT CCCTCAACCT GCCGGTATCG
GTGACGGTCA CCCAGGAAAA CCTGGATTAT TTCCAGAAAT TCGCCGCCCC GGGGTTCACC
GTAGAATCGG TCATCCTGGT CACCAACCAG TACGGCGAGG ATAAGATCTC CGTTTACGGC
ATGGGCAAGA TGGCGGGGCG CGACTACAGC GAACGCCCCC CTACCGCCAA AGCTACCGAG
GGCCCGAAGG CTAAGGAAAC CGGGACGCCG GAGACCCAGA CGAAGTAA
 
Protein sequence
MLSNITANCR RHLAKSSIVV IALLLSACAS QKKAFAPVFF PPEPSPPRIQ YLMGISVSTD 
VEEEKKNDFS LVVTGKGSSE AKRMIAKPYG ITSANGKIFV CDIGVGNLVV IDPAKKTFDY
LKGNNGLGKL KKPANVGLDQ YGNLFVADAA RREIMVYNAE GQFVRSFGKE ENMKPSDVAI
DGDLVYVLDL QKTKNEIKVF DRESGKLTNT FGKRSDNAEG VNIPTNFTMD HKGDIYVTNA
GNGKVMKFDR DGHLLLTFGD LGDISGMFSR PKGVAVDREN RIYVVDGGNQ NVQVFNEKGR
ILTSFGDPGL MRGSLNLPVS VTVTQENLDY FQKFAAPGFT VESVILVTNQ YGEDKISVYG
MGKMAGRDYS ERPPTAKATE GPKAKETGTP ETQTK