Gene GM21_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4138 
Symbol 
ID8139512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4730430 
End bp4731542 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content63% 
IMG OID644871753 
Productcobyrinic acid a,c-diamide synthase family protein 
Protein accessionYP_003023911 
Protein GI253702722 
COG category[R] General function prediction only 
COG ID[COG0857] BioD-like N-terminal domain of phosphotransacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGA AGATCTTCAT CGCGGCATCG GGGCAGAACA TAGGTAAAAC CACCATCAGC 
GTCTCGCTTT TGCACCTGGC CCAGAAGAAG TACGGGCGGG TCGGATTCAT GAAGCCTTTG
GGGCCAAAGC CTGCGCTTTT GCGCGGGGTG TCCGTCGACA AGGACGCCGC CCTCATCGCG
CAGGTGTTCG GGCTCGACAA GGACCTCCCC TACATGTCTC CGGTCGTGGT GCACCCCGAC
ACCTCACGCC AGGCCATTGA CGGCAAGATC CCCCTGGACG AGCTTCCCGA CCGGATCCTC
GCCAGCTACG CGGAACTGGA AAAGCACTGC GACTTCATCG TCATAGAAGG GTCGGGGCAT
CCCGGGGTAG GGTCCGTGCT GAACCTCTCC AACGCCCGCA TAGCGAAGAT GCTGAACGCG
CCGGTCCTGA TGGTGAGCGG CGGCGGGGTC GGCAACGTCA TCGACACCCT GGCCATGAAC
ACCGCACTCT TCAAACTTGA GGGGGCCGAG GTGCGCGGGG TGCTGGTGAA TAAGCTCTTT
TCAGAGAAAC GGGCGCAGAC CCTCGACTAC CTCACCCGCG CTTTCGCCGG AAAGCCCTTC
ACGGTGCTGG GGGGCTTCGA CTACAAGCCC GTCCTCGCCA ACCCGACGCT GAGCCGGGTG
GCGCGCCTTT TGGACCTGCC GCTGCACGGC AACCGCCGCG AGGTGCGGCG CATCATCCAT
CACGTGCAGA TCGGCGCCGC CTCCACCCAG CGCGTCACGG AAATGCTGCG CGACTCCTCG
CTGTTGCTCG TCACCAGCAG CCGAGACGAA CTCCTGGTCA CCTTGGCCAA CCTATACCAG
ATGCCGGAAT ACCGTTCCCG CATCGTGGGT CTGGTCATCC CCGGGATCTC CGACGTCAGC
GTCATCACCC AGCGCATCAT CGACCGCAGC AACATCCCCT ATTTCCGCAC CGAAAAGCTA
AGCACCGCCG AGCTTTACCG CCTCATCACC GACGACGTCT CCAAGATCAC CGCAAGGGAT
ACCGAGAAGC TGCGTCTCAT CAGGTCCCTG GCCGAGGAGC GGCTCGATTT CGACGCCATA
GACGAGGTAT TCGCCACGCC GCCTCGGCTT TGA
 
Protein sequence
MARKIFIAAS GQNIGKTTIS VSLLHLAQKK YGRVGFMKPL GPKPALLRGV SVDKDAALIA 
QVFGLDKDLP YMSPVVVHPD TSRQAIDGKI PLDELPDRIL ASYAELEKHC DFIVIEGSGH
PGVGSVLNLS NARIAKMLNA PVLMVSGGGV GNVIDTLAMN TALFKLEGAE VRGVLVNKLF
SEKRAQTLDY LTRAFAGKPF TVLGGFDYKP VLANPTLSRV ARLLDLPLHG NRREVRRIIH
HVQIGAASTQ RVTEMLRDSS LLLVTSSRDE LLVTLANLYQ MPEYRSRIVG LVIPGISDVS
VITQRIIDRS NIPYFRTEKL STAELYRLIT DDVSKITARD TEKLRLIRSL AEERLDFDAI
DEVFATPPRL