Gene GM21_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3643 
Symbol 
ID8139017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4220729 
End bp4221970 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content50% 
IMG OID644871264 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_003023422 
Protein GI253702233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones166 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCT TCCAGTCTCG TCATCAGTCA CTGTTTTTAA TCTGTATTCT ACTGACCGGT 
TTGCTGACTG GCGGTTGCTC GACAATCAAG GGTATCTCAA TGATCCGGGG AGGTTCTCCC
CAGAGCACCT TGGCTGGCGA CGAATCTGTA AAAGCAGAGC AGATGGCCCA TCTTCTGACC
GTAAAGGTCA GGATCGATGA TGCACCAGAG GATCTGACTT TCATGGTGGA TACAGGTGCC
ATTACCGTCA TAGATGAACA GATCGCCAAG AGATTGAAGT TCAAGGACTC GGTAACCAAT
AAGGTTACAG ACTCAGCGGG GAACAAAAAA GACATCCGCC TGGTTCAGGT GAACAAAATA
AGCGTCGGTA AAGTCGCAGT TTCAGATTGC GCTGCCGCGG TTGTCGATAT GAAAAAATTC
AACCCCAAGA TAGACGGCCT GCTCGGCTCG AATTTCCTGA GGTTTTTCAC GGTTCAGCTG
GATTACCGAA ATCACCGTGT GTCGTTCCTG AGTAAGTCGG ACGGGCGCTC CCTTGAGGGG
GCGATGAAGT TGCCGATGTG GCAAAACATG AAGTTCGGAT TCGCACCTAC CATCAAATGC
GAAGTAGACG GCTCGGTAGC TCTCGACTGC ATGGTCGATA CCGGACACGA TGCGATCGCC
TCCTTTCCTC TCTCCATTCT CGACAAGCTC CCTCACTTCA AGACCGGAGA ATATATCAGC
TCCAACGGCT CAATGGGAGC AGGAATATTT GGCAAAGACA CCCAAAGTTA CCTGGTCAGA
ACGGATCGAA TAGCATCAGG TCCCATAACC ATAGAAAATG CGGCGATTGT CTCTAACCGG
TTTGAAGATG TCATGACCCT TGGAGCCGCC TATCTGAAGA ACTTCCTGGT GACCATCGAT
TATCCAGCCT CTTTGCTGTA CTTGAAGCAG TACGACGATC AGCATCTCGA AAAGGAGATG
ATGTCCTACG GATTCGCCGT CTCCTATGAG AAGGATAAAG CGATCGTGAG CGGCCTATGG
AGAGGGAGTG CCGCGGACAA AGCGGGAATA TCACTCGGTG ACGAGGTGAT TGCTTTGAAC
GGCCATGAGA CATCGGGGTT GTCTTTATTC GACATGATGC AACTTGTGAA ATCGAACGAA
ACCCTGAGCA TTTCCTATAT CAAAAGTTCC AACGGGACCA AGTCGGATTT AACCCTCCAT
AAAGGGGACT TGACGCTCCT TCTACCGCCG TCGCCCAACT GA
 
Protein sequence
MDVFQSRHQS LFLICILLTG LLTGGCSTIK GISMIRGGSP QSTLAGDESV KAEQMAHLLT 
VKVRIDDAPE DLTFMVDTGA ITVIDEQIAK RLKFKDSVTN KVTDSAGNKK DIRLVQVNKI
SVGKVAVSDC AAAVVDMKKF NPKIDGLLGS NFLRFFTVQL DYRNHRVSFL SKSDGRSLEG
AMKLPMWQNM KFGFAPTIKC EVDGSVALDC MVDTGHDAIA SFPLSILDKL PHFKTGEYIS
SNGSMGAGIF GKDTQSYLVR TDRIASGPIT IENAAIVSNR FEDVMTLGAA YLKNFLVTID
YPASLLYLKQ YDDQHLEKEM MSYGFAVSYE KDKAIVSGLW RGSAADKAGI SLGDEVIALN
GHETSGLSLF DMMQLVKSNE TLSISYIKSS NGTKSDLTLH KGDLTLLLPP SPN