Gene GM21_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3064 
Symbol 
ID8138410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3553839 
End bp3555047 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID644870664 
Producthypothetical protein 
Protein accessionYP_003022850 
Protein GI253701661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.161492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTTA ATCCTTTCAA AGAGCGCGGC ATCGCCGCCG ACAAGCAGCT GAGGAACTGG 
CAGGAGTTGA ACGTCAAACC TTACGACAAG AACGAGGCGC ATCCCTACAC GAAGGCCCGC
ATCATCCTCA TGAACGGCGT GGAGGTCGAG GGGGCCATCT TCTCGCACCA GTTCGCCCGA
AACTGCAACG ACCCGGAACT AAAGAAGCAG TTAGCCCTCA CCCGCAGGGT CGAACAGCAG
CAGCAAAAAA CCATCAACTG GCTTTCCCCG GGCGACGAGT CGCCGCTTGA GACCACCATC
GGCTACGAGC AGGTGGCGGT CGATCTCACC GCCTTTCTCG CCGCAAACGT CCCCGATCAG
TACGTGAAGC AGGTCTTCGA TTTCGGGCTT CTGGAGGACT TCGACCACCT GTACCGCTAT
GCCAACCTCC TGGAGATGAC GCAGGGGGTG ATGGCTGAGA AGCTGGTCGG GAAGCTGACC
GAGATCACCC CTGGGCGCCC CACCATCAAG GAGCACCGCC ACCCCTTCGA CGATGTCAGG
AAGCCGATGA ACCGTTTGGC CGCCGATCCG CTCACCAAGC TCTACACGCT CACCCTTTTG
GCAGGCGAGC AGCAGACCAT GAACTTCTAC ATGAACATCG GCAACACGCT GCAGGACCAG
GTCGGGCGGG GGTTGTACCA GGAGATCGCC ATGATCGAGG AGCAGCACGT CACCCAGTAC
GAGTCGCTGC TCGACCCGCA GACCCCGTGG ATCGAGAACG CGCTCCTGCA CGAGTACAAC
GAGTGCTGGC TCTACTGGTC CTTCCTGCAG GAAGAGACCG ACCGCCACGT GAAACCGATC
TGGGAACTGC ACCTGGGCAT GGAGCTCACC CACCTCCAGA ACTTGGGCAA TGTCGCCGGC
AAGATGGGGG TGAACGTGGA TCAGGTGCTG CCGCAGACCT TCCCCGCGCC GCTGCAGTTC
AAGTCCCAGG TGAACTACGT CAGGGAGATC CTGGCCACCC AGGTCGACTA CAACGCCTTC
GAGACCGAGA TCGGTCAGCC GGACCAACTC CCGGAAAACC CGCGCTACCT CGAGTGCCAG
GACCTGCTCA ACGCAAAGGG GGCCCCGAGC GAAGAGGTGA TCAAGATGAA CCGGCGCAAG
AATGGGCAGG ATTACCGCCT GGAGTTGGCG GGGCAGCATC CCGTAAAGGA ACTGCGTCAG
AAGAAATAG
 
Protein sequence
MSFNPFKERG IAADKQLRNW QELNVKPYDK NEAHPYTKAR IILMNGVEVE GAIFSHQFAR 
NCNDPELKKQ LALTRRVEQQ QQKTINWLSP GDESPLETTI GYEQVAVDLT AFLAANVPDQ
YVKQVFDFGL LEDFDHLYRY ANLLEMTQGV MAEKLVGKLT EITPGRPTIK EHRHPFDDVR
KPMNRLAADP LTKLYTLTLL AGEQQTMNFY MNIGNTLQDQ VGRGLYQEIA MIEEQHVTQY
ESLLDPQTPW IENALLHEYN ECWLYWSFLQ EETDRHVKPI WELHLGMELT HLQNLGNVAG
KMGVNVDQVL PQTFPAPLQF KSQVNYVREI LATQVDYNAF ETEIGQPDQL PENPRYLECQ
DLLNAKGAPS EEVIKMNRRK NGQDYRLELA GQHPVKELRQ KK