Gene GM21_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3894 
Symbol 
ID8139268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4480909 
End bp4482291 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content67% 
IMG OID644871511 
Productcobyrinic acid a,c-diamide synthase 
Protein accessionYP_003023669 
Protein GI253702480 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1797] Cobyrinic acid a,c-diamide synthase 
TIGRFAM ID[TIGR00379] cobyrinic acid a,c-diamide synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0538501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA TCGTGATAGC GGCGCCCTCC AGCGGCTGCG GGAAGACCAC CGTGACCTTG 
GGCGTCATGG CGGCCCTGAA AAGGCGCGGC CTCAAGGTGG CCCCCTTCAA GGTGGGACCG
GACTTCATCG ATCCCGGCTA TCACCGGGTG GCGACCGGGG TCCCCTCCGT CAACCTGGAC
GGCTGGATCT GCGATCCCCA TTTCCTGAGG GAGAGCTTCC TCCACCATGC GGCCGCTGCG
GACATCGCGG TCGTCGAGGG GGCGATGGGG CTCTTTGACG GGATCGACGG GCTCTCCGAA
TCGGGGAGCA GCGCGCAGGT CGCCAAGGAA CTCGCAGCCC CCGTGGTCCT CGTGGTGGAT
GCGCGCAGCC AGGCGAGAAG CGCCGCAGCG CTGGTGCACG GCTTCGCCGG CTTCGACCCG
GCGCTGCGGG TGGCAGGGGT CATCTTCAAC AACGTCGCCA GCGAGAACCA CGAGCGCATC
CTGCGGGAGG CGCTCGGCGC GGCGGTGCCG GGCGTGCAGG TGATCGGCTG CCTCCCCAGG
GACCCCGCCC TCGCCATCCC TTCGCGCCAT CTGGGACTGG TGACGGTGGA GGACAACCCG
CTCTCGGACC CCTTCCTGGA CCACCTGGTC GCTGTCGTGG AAGAGCACCT TTACCTCGAC
GCGCTCCTCG ACCTGGAGGT CGACGAACTG CGGGATCATG CCGCGCCGGC CGCCGGCAGC
CCTGCTGCCA GCCGGGACCG GGTGAGGATC GCGGTGGCGC GGGACGCGGC CTTCTGCTTC
GTCTACGAGG ACAACCTGCG GCTCCTGGAG CAAAGCGGGG CCGAACTCTG CTACTTCTCC
CCCCTCGCCG ACAGCTTGTT GCCGGAGGCT ATCGGCGGCA TCTACCTCCC CGGGGGGTAC
CCTGAGCTCT TCGCGGCGCG CCTTGCCGCC AACGAGCCGA TGAAACAGGA GATCCGGCAG
GCGGTGGAGG GGGGAATGCC CGTCTATGCC GAGTGCGGCG GGTTCATCTA CCTCACCCGT
GGGGTGGCCG CGGAGGGGGA AAGCCATGGC TTTGCCGGCA TCTTCCCGGT AGAGACCCGG
ATGCTGCCGC GCCGCAAGGC GCTCGGGTAC CGCGAGGTGG AACTGCTGGA AGATTGCACG
CTCGGCCGCA AGGGGAGCAT CGCCCGCGGC CACGAGTTCC ACTACTCCGA GATGCAGGAA
ATGCCCCCCA ACGTGGAGCG CCTGTACCGG GTCACCCGCA AGGGGGTGGA ACTCGCGCCC
GAAGGTTACC GTTACAAAAA CTGCCTCGCC TCTTACATAC ATCTACACTT CGGCAGCTCG
CCAGGCCTGG CTCCTCACTT CGTGGAACAG GGAAGGGCGT ACCAAAAAAG GAGCCTCACA
TGA
 
Protein sequence
MKRIVIAAPS SGCGKTTVTL GVMAALKRRG LKVAPFKVGP DFIDPGYHRV ATGVPSVNLD 
GWICDPHFLR ESFLHHAAAA DIAVVEGAMG LFDGIDGLSE SGSSAQVAKE LAAPVVLVVD
ARSQARSAAA LVHGFAGFDP ALRVAGVIFN NVASENHERI LREALGAAVP GVQVIGCLPR
DPALAIPSRH LGLVTVEDNP LSDPFLDHLV AVVEEHLYLD ALLDLEVDEL RDHAAPAAGS
PAASRDRVRI AVARDAAFCF VYEDNLRLLE QSGAELCYFS PLADSLLPEA IGGIYLPGGY
PELFAARLAA NEPMKQEIRQ AVEGGMPVYA ECGGFIYLTR GVAAEGESHG FAGIFPVETR
MLPRRKALGY REVELLEDCT LGRKGSIARG HEFHYSEMQE MPPNVERLYR VTRKGVELAP
EGYRYKNCLA SYIHLHFGSS PGLAPHFVEQ GRAYQKRSLT