Gene Strop_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1598 
Symbol 
ID5058056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1818904 
End bp1819959 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID640473871 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001158442 
Protein GI145594145 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.46562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGTTCT CTCCTGTTCA TCACTTTCGG GACGCCGCCG TCAGCGCGGC GCTCGGCTTC 
GGTGACCTGG TGGCCGATCC GGTGTGGCTG CGTCCGATCC TGGTGGAGAG GGACCTGCTC
ATCCTGGTCA CCGCGGGGCA TGGCCGCGCC GAGGTGGACT TCCACACCCT GTCCTGCCGC
CCGGGCACAC TGCTGCGGGT CCGGGCCGGC CAGGTGCTTC GGTGTGGACC ATCCAGCCTT
GGTGCGATCG TGGTGCACTG GACGTCGACG GCGCTGCACG GATTCGACGT TGCCCCCGAG
GCGGCCCCGG CCTGGCTTGA GCTGACCGGT GCGGACGGGG CGACCATCAG CACCGGGGTC
CATCAGCTCG CGGCGGACTG CGAGCGGCAT CGCGGGGCGC CCGCCGCACT GTTCCGCCAC
CAGCTGGCGG CGCTGCTGCT ACGGCTGGCC CTGCTGGTGG ACTCCGGCCG GGGGTCTCAG
CCCGCGCCGC GGTCGGCGTC GCGCACCGAG ACCAACACGT TCCGACTGCT CTGCCGGGAG
TTGGAGCAGG GCTACCAGCG CAGCCGACGG GTGGAGGACT ACGCCGACCA ATTGGGCTGC
TCCGTTCGTA CCCTGACCCG CGCCTGCCTG GCGGTCACCG GGCGCAGCGC GAAGCAGGTG
GTGGACGAGC GGGTGGCGTT GCAGGCCCGC CGCCTCCTCG CGGCGACCGA CGAACCGGTG
GCGCGAGTAG GCCAGCGGCT CGGTTTCTCC GAGCCGACCA ACTTCGGCCG GTTCTTCACC
CGGGAGGTCG GGGTCAGTCC GGGAGCGTAC CGCGCCGCTT GGGAGCACCC CGCCGACCAC
TCGACGCCGA CCGAACCGGA CTCGGCACCC ACCCTGCCCG CCCGGGACGC TCCGTCCCTG
GTACGCCCGC GCCCACCCGC CGACGCCGAC GACGGTCAGT CACAGATGCC GGGCCACAGC
GATGACCACG TCACCGCCGA GTCGGCCGTG CTCGGGGTCG CGAACAGCGG CGCCGCATCG
CAGCAGCCCG GTGAGCGCCC GCTGCACGTC GACTGA
 
Protein sequence
MMFSPVHHFR DAAVSAALGF GDLVADPVWL RPILVERDLL ILVTAGHGRA EVDFHTLSCR 
PGTLLRVRAG QVLRCGPSSL GAIVVHWTST ALHGFDVAPE AAPAWLELTG ADGATISTGV
HQLAADCERH RGAPAALFRH QLAALLLRLA LLVDSGRGSQ PAPRSASRTE TNTFRLLCRE
LEQGYQRSRR VEDYADQLGC SVRTLTRACL AVTGRSAKQV VDERVALQAR RLLAATDEPV
ARVGQRLGFS EPTNFGRFFT REVGVSPGAY RAAWEHPADH STPTEPDSAP TLPARDAPSL
VRPRPPADAD DGQSQMPGHS DDHVTAESAV LGVANSGAAS QQPGERPLHV D