Gene Sare_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2699 
Symbol 
ID5708373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3072472 
End bp3074124 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content74% 
IMG OID641272157 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_001537527 
Protein GI159038274 
COG category[H] Coenzyme transport and metabolism
[S] Function unknown 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG2138] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.2804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000140912 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGCAC TGGTCATCGT CGGGCACGGC ACCCGTAGCG CCGAGGGGGT CGCGCAGTTT 
GCCGCACTGG TCGAGCGGGT TCGTACCCGT GCGGCCGGTA CGGTCGGCGA CGTCGAGGGC
GGCTTCATCG AGTTGTCCCG CCCACCACTG ACCGACGCGG TCGGTGCGCT CGCCGCCCAG
GGGCACCGGG CACTGGTGGC GCTGCCGCTG GTGCTCACCG CCGCCGGTCA CGGCAAGGGC
GACATCCCCG CGGCGCTGGC CCGCGAACAG CAGCGCCACC CCGGTCTGTC ATACGTGTAC
GGCCGCCCGC TCGGCCCGCA CCCGCTGCTG CACACCGTCC TCGAGGAGCG GATCGACGCA
GCGCTGGCCG GTGCCGACCG GGCCGGCACG TGGGTGGCGT TGATCGGGCG AGGGTCCACC
GACCCGGACG CGAACGCCGA GGTCGCCAAG GTGGCCCGCC TGTTGTGGGA GGGGCGGGGC
TACGCGGGTG TGGAACCGGG TTTCATCTCG CTCGCCGAGC CGTCCGTGCC GGCGGTCCTG
GACCGGCTGC GCCGGCTCGG GGCGCGGCGG ATCGTGGTCG CTCCGTACTT TCTGTTCGCC
GGGGTGCTCC CGGATCGGAT CCGGGCCCAG TCGCAGGAGT ACGCCGCCGC TTATCCGGAG
TTGGACCTGC GGGTGGCGGA TCTGATCGGG GACTGTGACG CCCTCGCCGA CCTGGTGCTG
GAACGCCGTG CCGAGGCGGT GCGCGGCGAC ATCCGGATGA ACTGCGACAC CTGCGCGTAC
CGGGTGTCGA TGCCCGGCTT CGTGGACAAG GTGGGCCGGC CGCAGACCCC GCACGACCAC
CCGGACGACC CGACGGGCGG GCACACCCAC ACCCACCACC ATCACCCCGA GCCCGTGTTG
CGGCCGGGTG AGGTGGCGGT GGTCGGGGGC GGGCCGGGCC CGGACGACCT GATCACCGTC
CGGGGCAGAG CACTGCTCGA CGTCGCCGAC GTGGTGGTTG TCGATCGGCT CGCCCCGCAG
GGTCTGCTCG TTGGGCTGCG CCCGGGGGTG ACCGTGGTGG ACGCGGCGAA GTCACCCCGG
GGGCCATCCG TCGGGCAGGA CGACATCAAC ACCGCACTGG TCCGGCACGC CCGCGCCGGC
CGGCGGGTGG TCCGGCTCAA GGGCGGTGAC CCGTACGTCT TCGGGCGCGG GCACGAGGAG
GTGTTGGCCT GCGCGGCGGC CGGGGTACCA GTGACCGTGG TGCCGGGGGT GACCAGCGCG
GTGGCCGCGG CGGCCCTGGC CGGGGTGCCG GTCACCCATC GCGGGACGGC GCACGAGTTC
ACCGTCGTGT CCGGGCACCT GCCGCCGAGG CACCCGGACT CGCTGGTCGA CTGGGTGGCG
TTGGGTCGGG CCCAGGGCAC CCTGGTGGTG CTGATGGGTG TCGACACGAT CGGGGACATC
GCGGCGGAGC TGATCGCGCA CGGGCGCGCC CCGGACACCC CGGTCCTCGC CGTGCAGGAT
GCCGGTCATC CGGAGCAGCG TTCACTGCCC GCGCGCCTGG ATGGGATCGC TGAGGTGGCT
GTCCGGGCGG GAGTGCGTCC ACCGGCGGTC TTTGTGGTTG GCCCGGTCGC GGCGTTCGCC
GAGGTGCCCG TCCCGGTAGC GTCCGGTCGG TGA
 
Protein sequence
MTALVIVGHG TRSAEGVAQF AALVERVRTR AAGTVGDVEG GFIELSRPPL TDAVGALAAQ 
GHRALVALPL VLTAAGHGKG DIPAALAREQ QRHPGLSYVY GRPLGPHPLL HTVLEERIDA
ALAGADRAGT WVALIGRGST DPDANAEVAK VARLLWEGRG YAGVEPGFIS LAEPSVPAVL
DRLRRLGARR IVVAPYFLFA GVLPDRIRAQ SQEYAAAYPE LDLRVADLIG DCDALADLVL
ERRAEAVRGD IRMNCDTCAY RVSMPGFVDK VGRPQTPHDH PDDPTGGHTH THHHHPEPVL
RPGEVAVVGG GPGPDDLITV RGRALLDVAD VVVVDRLAPQ GLLVGLRPGV TVVDAAKSPR
GPSVGQDDIN TALVRHARAG RRVVRLKGGD PYVFGRGHEE VLACAAAGVP VTVVPGVTSA
VAAAALAGVP VTHRGTAHEF TVVSGHLPPR HPDSLVDWVA LGRAQGTLVV LMGVDTIGDI
AAELIAHGRA PDTPVLAVQD AGHPEQRSLP ARLDGIAEVA VRAGVRPPAV FVVGPVAAFA
EVPVPVASGR