Gene GM21_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2157 
Symbol 
ID8137493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2520342 
End bp2521421 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content61% 
IMG OID644869772 
ProductTPR repeat-containing protein 
Protein accessionYP_003021967 
Protein GI253700778 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones114 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTCG GACTTTTCAA GAAGAAGGAT CATCGTTACT ACCAGGCCCA GGGTGTTAAG 
TTTCTGGCTG CGGAGCGCTA TGCCGACGCC CGGGTCGACT TTCTCGAAGC GCTGAGGCTT
TGCCCTGCTG ACGCCGTGAC CGACCAGGGC GAGATCCGCC AGGGGCTGGA TCGCTCGGGA
AACCGGCTGG GCGAACTCAA CCTGGAGGAG GGGGAACATT GCCTAAACCT GGGGGAGCTG
CAAAAAGCGT TCGACCACTT CACCCTCGCC GCCGAACTGG CAGCCGACCA GGGGATCAAG
GCCAAGGCCC AATCGGGGCT CGGCAGGGTG CAGCAGGGGA ACGCACCACC GGCTTCCCCT
GCCGCAGCCG CCGTTACTCC GGCTGCCGCC GCTCCCGCGA AGGAAGTTGC CGGGCCCTAC
AAGCCGCACG GCGGAGGCTC CTGCACCTCC TGCGGCACCC ACGCACCGAA AAAGCCTCTG
GAGGCGGAGC CCACCGGATT CGATCTCGCC GACGAGGACC AGTTTCACCT CATGGTGGCG
CCGCTTCCCG GCGACCTCCC CGTCCGTTAC GGCGCCATGG GAAGCAAATT CGCCCAAGCC
TATCTCATGA TACACGACGG AAAGGACGCT AATGCGCTCC CCGTTTTGCA AGAAATGCTG
TTATCTGGTG AAAATGACAT TGTATTATAC GAAGTGGCAC TTATAATGTT CAGGGCCGGG
CGCATTCATG AGAGCCAAGC GCTTCTGAAT CGCGCTCTTT CGGTCAACTC GGGAAACGGC
ATGGCTTACC TCGCGCTGGT GCAACTTTTG GCCGGCGGCG GCAGGTACGC CGAGGCAATC
GCCCTGGTTG AACGGATGCT GGCGGAAAAC GTGATGGCGG ACCAGGCGCA GTTCATCCTG
GGCGAGCTCT ACGAGACGAC GGGGGACGAG GCGAAGGCGA TCGAGATGTG GTCGAAGGCG
CTGGAGATAC CGACCGTGGC ACGCGCGGCC GCCGAGAAGC TGGTCCCGAT CCTGGGGAGC
CAGGGGCGTA CCGAAGAGGT CAAATATCTA GCCAAAAAGT ACTTAAAAGG ATGCTGCTAA
 
Protein sequence
MLFGLFKKKD HRYYQAQGVK FLAAERYADA RVDFLEALRL CPADAVTDQG EIRQGLDRSG 
NRLGELNLEE GEHCLNLGEL QKAFDHFTLA AELAADQGIK AKAQSGLGRV QQGNAPPASP
AAAAVTPAAA APAKEVAGPY KPHGGGSCTS CGTHAPKKPL EAEPTGFDLA DEDQFHLMVA
PLPGDLPVRY GAMGSKFAQA YLMIHDGKDA NALPVLQEML LSGENDIVLY EVALIMFRAG
RIHESQALLN RALSVNSGNG MAYLALVQLL AGGGRYAEAI ALVERMLAEN VMADQAQFIL
GELYETTGDE AKAIEMWSKA LEIPTVARAA AEKLVPILGS QGRTEEVKYL AKKYLKGCC