Gene Strop_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2999 
Symbol 
ID5059463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3429571 
End bp3430581 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content72% 
IMG OID640475250 
Productregulatory protein, MerR 
Protein accessionYP_001159815 
Protein GI145595518 
COG category[C] Energy production and conversion
[K] Transcription 
COG ID[COG0789] Predicted transcriptional regulators
[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.645753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG GCGACGTGGC GCGCCGCTCG GGAGTGAGCA CCCGCATGCT TCGGCACTAC 
GACGCACTGG GGCTGGTCCG ACCGACGGGT CGTACCTCGG GCGGCTACCG CGAATACTCG
GACGAGGACG TACGGCGGCT GTTCCAGGTG GAGAGCCTGC GGTCACTGGG GCTGTCGCTG
CGCCAGATCA CCCGGGCGCT CCAGGATCCC ACCTTCACAC CGGCCGGCCT GGTCGGCGAC
CTCATCACCG TGACCGAAGA GCGGCTGGAG CGGGAACGGG AGCTGCTCGA CCGGCTTCGT
ACGGTCGATG CCGCGGCGCC CTCCGGATGG TCGGGGGTCC TGCGCATCAT CGAGCTCATA
ACCGGGCTCA ACTCACCCAG TGCCGCCCTG CGTCAGCAGA GTGTTCTGGC CCCCGCCGAG
GAGGCACGGG TGCCCGCCGA GTTGCTGGCC GGCGCGGTCC TCGCCGAATC CGACCCCAAC
GTGGCCGGCG CGCTGCGATG GGCGCTCGCT CGGGCGGGCG GTGACCCCCT GGCGAGCCTG
GCCTCCGGTG TCCACTCCGA GAACGTGGAC ATCCGCCGCC GCGCGATCCA GGCGGTCGCC
GCCCTGACCG GTGACGAGGC GACAGCAGCG CTCGTGAACG CCCTCGGCGA CCCGGACCCG
GCGGTCCGCC GACACGCGGC CCTGGCGCTG GGCCGGCGTG GCGAGGTGGC GGCCGTGCCC
GCACTTGTCG ACCTGGTGGT CGAGGGCGGG CACGACGTCC AGGCGGCCGA ACTCCTGGGG
GCCCTGTCGG AGGACCCGGC CCGCGCGGAA CAGATCGTCA GCGCCCTCTC CGACGAGCTC
GCCGCCCCTA CCGCGGACTC CGCCGTCCGG AGCCGGCTCA CCCAGGCCAT CCTGGAGCTG
CCCCGGACCG TCGCGCAGCC CGTCCTGCGA CGGTTGTCCC ACGACGATGA CCCGGTGGTG
GCACTGACCG CCGCGGCCTA TCTGGAAGAC GAAGACTCCA CGTCTCGTTA A
 
Protein sequence
MLIGDVARRS GVSTRMLRHY DALGLVRPTG RTSGGYREYS DEDVRRLFQV ESLRSLGLSL 
RQITRALQDP TFTPAGLVGD LITVTEERLE RERELLDRLR TVDAAAPSGW SGVLRIIELI
TGLNSPSAAL RQQSVLAPAE EARVPAELLA GAVLAESDPN VAGALRWALA RAGGDPLASL
ASGVHSENVD IRRRAIQAVA ALTGDEATAA LVNALGDPDP AVRRHAALAL GRRGEVAAVP
ALVDLVVEGG HDVQAAELLG ALSEDPARAE QIVSALSDEL AAPTADSAVR SRLTQAILEL
PRTVAQPVLR RLSHDDDPVV ALTAAAYLED EDSTSR