Gene Jann_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3047 
Symbol 
ID3935518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3076917 
End bp3078320 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID637905418 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_510989 
Protein GI89055538 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.960594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGTT TTCCGATGTT TTTCCGCACC AGCGGACGGC GTGTGGTGAT TGTGGGCGGA 
GGTGAGCAGG CCGCCCAGAA GGCGCGCTTG ATCCTGAAGA CGGACGCGCA GATCGTGCTG
GCCGCGCCGG AGTTGGACCC GGAATTGGAG GGCATAGTCG CATCAGGCCG TGCCATGCAC
CAAATGGGCC CAGTGACGGA CACCACCTTC ACCGGCGCGG CCATGGCGTT CATCGCCACC
GGCTGCCCCG GCTGGGACGC GAGCGTGCAC GCGCTGGCGC AGGCCGCGCG GTGTCCGGTG
AACGTCGTGG ACCGCCCCGA CCTGTGCGAC ATCACCACGC CGTCGATTGT GGACCGTGAC
CCGGTGGTCG TGGCGATCGG CACGGAAGGA ACCGCCCCGG TTCTGGGCCG CGAGATCAAG
ACCCGGGTCG AGCAGATGCT GCCGGTCAAC ATCGGCGGCT TGGCTGAGCT GGCAGGTCGC
CTTCGCCCCG CCGTCACCGC GCAGGTCCCT CGCGCCAAGC GTCGCGCGTT TTGGGCCTGG
GTGTTCAAAA GCACGCCCCG TACCACTTGG ACGCGGGGTG CGGAACGGGA CGGTGCGCGG
ATGATCAAGG AGGCCATCGC GCAGGGCGGC GCGCCTGACA CGCAGACCAA GGGCTCCATC
GCTTTGGTCG GTGCCGGACC CGGCGCGCGC GATCTGCTGA CCCTGCGCGC GGTGGAGCGT
CTGCAGGAGG CCGATGTCAT CTTCTACGAC CGCCTCGTCG ATCCGGACGT GCTGGAACTG
GCGCGCCGCG ATGCCGAGAG GGTTTTCGTC GGCAAACACG TCGGCGCGCA TGCCTGGCCG
CAAGCGCAGA TCAACGGGAT GATCGTGGCG GAAGCCCTGA AGGGGCGCCG TGTCGTGCGG
CTGAAATCCG GCGATCCGGG CATCTTTGGA CGCGCAGGCG AAGAGATTGA GGCCGCTGAG
ACGGCAGACA TCCCCATCGA GGTTGTGCCC GGCATCACCG CCGCCTCTGC CGCAAGTGCT
GCCATGGGCC AAAGCCTGAC TACGCGCGGG CGAACTGACA CACTGGTGCT GGCAACCGGC
ACCGGCAATC CCGACGCGCC GCTCCCCGAT TGCGTCCGCT TTGCAGGTCC CGGCACGACC
ACCGCAATCT ATATGGGGGT CCGCCACGTG GACCGTATCT GCGCAGCCCT GCAGGCGCGG
GGTTTCCCCG CAAACGCGAT GATAGATGTG TGCGTTGACG TGGAAAAGTG CACGCAGCGC
CTGTTGTCTG AACACGTGGC GACCCTGCCC ATCCGCCTGA AAGCCGCGAA GATCGAGGGC
TGCGCACTTC TACTGGTCAA ATGGCCATTG GTTGAGCAAG CAAACGTAGC TCCGGAACCA
ATCCTGATAG ACGCAATGGG CTGA
 
Protein sequence
MKSFPMFFRT SGRRVVIVGG GEQAAQKARL ILKTDAQIVL AAPELDPELE GIVASGRAMH 
QMGPVTDTTF TGAAMAFIAT GCPGWDASVH ALAQAARCPV NVVDRPDLCD ITTPSIVDRD
PVVVAIGTEG TAPVLGREIK TRVEQMLPVN IGGLAELAGR LRPAVTAQVP RAKRRAFWAW
VFKSTPRTTW TRGAERDGAR MIKEAIAQGG APDTQTKGSI ALVGAGPGAR DLLTLRAVER
LQEADVIFYD RLVDPDVLEL ARRDAERVFV GKHVGAHAWP QAQINGMIVA EALKGRRVVR
LKSGDPGIFG RAGEEIEAAE TADIPIEVVP GITAASAASA AMGQSLTTRG RTDTLVLATG
TGNPDAPLPD CVRFAGPGTT TAIYMGVRHV DRICAALQAR GFPANAMIDV CVDVEKCTQR
LLSEHVATLP IRLKAAKIEG CALLLVKWPL VEQANVAPEP ILIDAMG