Gene Gdia_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2235 
Symbol 
ID6975664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2479041 
End bp2480453 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content75% 
IMG OID643391762 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_002276605 
Protein GI209544376 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG CGACCGACCC CGGCTGGTTC CCGCTGGCGC TGCGGCTGCG CGGCGCGCGG 
GTGGTGGTGG TGGGCGGCGG CGGGATCGCG CTGAACAAGG TCCGGCTGCT GCTGGCCCAC
GCCGCGCGGA TCGACATCCT GGCCCCCCGG CTGGAGGACA CGCTGGCCGC CTGGCAGGCC
GAAGGGCGGA TCACCCACAT CGCGGGCGAG GCGACGCCCG ACCGGGTGCG TGCGCTGCTG
CCCGGCAGCC GCCTGGTCTA TGCCGCGACC GACGACCGGG CGGTGAACCG CGCCGTCGCG
GCGCAGGCCG ATGCGCTGAA TATCCCGGTC TGCGCGGTGG ACGACCCGGA GCCGTCTTCC
TTCATCACGC CCGCGCAGAT CCATCGCGGG CCGGTGCGGA TCGCGATTTC CACCGGCGGC
GCGGCCCCGG TGCTGGCCCG GCGCCTGCGC GAGCGGATCG AGGCCGTGAT GCCGGCCGGG
CTCGACGCGC TGGCGCGCTT CCTGCAGGCC GAGCGCGCTC ATGTCGTGGC TGCCTGCCCC
GATATCGGCC GCCGCCGCCG GGTATGGGAG GATTTCCTGG ACGGCCCAGG CGGCGAGGCG
GCGCAGCGCG GCGAACACGC GGCCGCGCGA CAGGTACTGG ACCACCTGCT GGCCGGCGCG
CAGACCGGGG GCGAGGTCTG GCTGGTCGGC GCCGGGCCGG GGGACCCGGA CCTGCTGACC
CTGCGGGCGC TGCACCTGAT GCAGAACGCG GATTCGGTGC TGTACGACCA GTTGCTGCCG
CCCGCGTTGA TGGACCGGGT GCGCCGCGAT GCCGAGCGGG TGTTCGTGGG CAAGCAGCGC
GACCGCCACA CCATGCCGCA GGACGACATC AATGCCGAAC TGATCCGCCG CGCGCGGGCG
GGCGAGCGGG TGCTGCGCCT GAAGGGCGGG GACCCGTTCA TCTTCGGTCG CGGCGGCGAG
GAGATCGAGG CCCTGATGGC GGCGGGAATT CCGTTCCAGG TCGTGCCGGG CATCACGGCG
GCCAGCGGCT GCGCCGCCTA TGCCGGCATT CCGCTGACCC ACCGGGACTG CGCCCAGTCC
TGCCTGTTCG TCACCGGTCA CGCCCGCCGC GACGGCACGC TGGACCTGCC GTGGGACAGC
ATGGCCCGGC CGGGGCAGAC CATCGCGATC TATATGGGCG TCACCGCGCT GCCGGACCTG
TGCACCATGC TGGTGCGCCA CGGCCTGCCG CCCGACTGGC CCGCCGCCGT GGTGGAGCGC
GGCACCCGGC CCGACCAGCG CGTGCTGACG GGAACCCTGG CCGACCTGCC GGCGCTGGCG
CGCGCCCATG CCGTGGGCAG CCCGGCGCTG GTGCTGGTGG GCCAGGTGGT GCGGCATCGC
GTCGTCACGC CGCCGCCCCT GTCCGGTACG TGA
 
Protein sequence
MSDATDPGWF PLALRLRGAR VVVVGGGGIA LNKVRLLLAH AARIDILAPR LEDTLAAWQA 
EGRITHIAGE ATPDRVRALL PGSRLVYAAT DDRAVNRAVA AQADALNIPV CAVDDPEPSS
FITPAQIHRG PVRIAISTGG AAPVLARRLR ERIEAVMPAG LDALARFLQA ERAHVVAACP
DIGRRRRVWE DFLDGPGGEA AQRGEHAAAR QVLDHLLAGA QTGGEVWLVG AGPGDPDLLT
LRALHLMQNA DSVLYDQLLP PALMDRVRRD AERVFVGKQR DRHTMPQDDI NAELIRRARA
GERVLRLKGG DPFIFGRGGE EIEALMAAGI PFQVVPGITA ASGCAAYAGI PLTHRDCAQS
CLFVTGHARR DGTLDLPWDS MARPGQTIAI YMGVTALPDL CTMLVRHGLP PDWPAAVVER
GTRPDQRVLT GTLADLPALA RAHAVGSPAL VLVGQVVRHR VVTPPPLSGT