Gene Sare_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0427 
Symbol 
ID5708404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp487935 
End bp488903 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content73% 
IMG OID641269952 
Productporphobilinogen deaminase 
Protein accessionYP_001535347 
Protein GI159036094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.751162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00242186 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTCC CCCTACGCCT CGGCACCCGG GGCAGCGCCC TGGCGATGGC CCAGTCCGGG 
CAGGTCGCGC AGGCCCTCAC CGCGGCCACC GGCAACCCGG TGGAACTGGT CGAGGTGGTG
ACTGCCGGCG ACCGTTCCGC GGCCCCGGTG CACCGGCTCG GCGTCGGCGT GTTCGTGTCC
GCGCTGCGGG ACGCCCTCAC CGCGCGCACC ATCGATTTCG CGGTGCACTC GTTCAAGGAC
CTGCCCACCG CGGCGGCTCC CGGCCTGCAT GTCGCGGCCG TGCCACCCCG GCAGGATCCC
CGGGACGCGC TCGTCGCCCG TGCCGGCCGT ACGCTCGCCG AGCTGCCGCC CGGCGCCCGG
GTCGGCACTG GTGCGCTGCG TCGTATCGCC CAGCTACACG CGCTCGGGCT TCAGCTCGAC
GTCCAGCCGA TCCGGGGTAA CGTCGGCACC CGGCTGGCGC GGGTGCTCGG GCCGGAGGCC
GACCTGGACG CGGTTGTCCT GGCCCGGGCA GGGCTGGCCC GGCTGGATCG CACCGACACG
ATCACCGAGA CGCTCGACCC GATGCTGATG CTGCCCGCGC CCGCCCAGGG GGCGCTGGCG
GTGGAGTGCC GGTCCGACGA TTCGGACCTG GTCGAGCTGC TCGCTGTACT CGACCACGCA
CCGTCCCGCG CCACGGTCGT CGCGGAGCGG GCGTTGCTTG CCACCCTGGA GGCCGGGTGC
AGTGCCCCGG TCGCCGCCTA CGCCGAACTA GCCGAGGGCG ACGTCGGTGA AGAGATCTAC
CTGCGCGGGG CGGTGATCAG TCCGGACGGC ACGCGTGACC TCCGACTGTC TCGCACCGGA
ACGCCCGCCG ACGCGGTGGA GATCGGTAAG GCCCTCGCCG CTGATCTTCT CGAACTCGGC
GCCGACTCGA TCCTCGGCCA GGAAGGACAC TTCGGCCCGG GGACCCAGCA ATTTGGGAGC
ACAGTATGA
 
Protein sequence
MSVPLRLGTR GSALAMAQSG QVAQALTAAT GNPVELVEVV TAGDRSAAPV HRLGVGVFVS 
ALRDALTART IDFAVHSFKD LPTAAAPGLH VAAVPPRQDP RDALVARAGR TLAELPPGAR
VGTGALRRIA QLHALGLQLD VQPIRGNVGT RLARVLGPEA DLDAVVLARA GLARLDRTDT
ITETLDPMLM LPAPAQGALA VECRSDDSDL VELLAVLDHA PSRATVVAER ALLATLEAGC
SAPVAAYAEL AEGDVGEEIY LRGAVISPDG TRDLRLSRTG TPADAVEIGK ALAADLLELG
ADSILGQEGH FGPGTQQFGS TV