Gene Strop_3451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3451 
Symbol 
ID5059920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3957818 
End bp3959212 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content71% 
IMG OID640475700 
ProductCBS domain-containing protein 
Protein accessionYP_001160260 
Protein GI145595963 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.553184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0782726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCT CTCTTGCCGC TGCGACGGCC GGCGCTGCCG GCCTCCCCGA CGTGCAACTG 
GTCTTCCTCG CGGCCGGGCT GGTGGTCCTC GCCGGCCTGA TTGGGATGAC CGAGGCGGCG
CTCGCCGCCG TCTCTCCGGC CCGCGCGGCC GAGTTGGTCC GCGACGGAGC CCGGGGTGCC
CGAGCGTTAC AGTCCGTCGC CGGTGATGTC GTCCGGCACC TCAACCTACT GCTGCTGTTG
CGGCTGCTCG CCGAGCTGAG CGCGACCACC CTGGTGGCTC TGGTCGCGGT CGACTCGCTC
GGCGCGGGCT GGCGGGCCGC GGTGGTCACG GCCGGGGCGA TGACCGTGGT CAGCTTCGTG
GTGGTCGGCG TCGGGCCGCG CACCATCGGG CGTCAGCACG CCTATGCGGT GGGGCGTGGG
GTGGCGCCGC TGGTGCGTTG GCTGGGTCGG GCCCTCAACC CCCTCGCCTC TCTGTTGATC
CTGATCGGTA ATGCGGTCAC GCCGGGGCGC GGCTTCCGGG AGGGTCCGTT CGCCACCCAG
GTGGAGTTGC GCGAGCTGGT CGACCTGGCC GAGCAGCGCG GCGTGGTGGA GCATGGCGAG
CGGCAGATGA TCCATTCCGT CTTCGCGCTC GGTGACACCA TCGTCCGCGA GGTGATGGTG
CCGCGCACCG AGATGGTCTG GATCGAGCGC CACAAGATGC TCTCCCAGGC CCTGGCGCTC
TTTCTGCGTT CCGGCTTCTC CCGCATCCCG GTCATCGGCG AGAGCGTCGA CGATGTGCTC
GGCGTGCTCT ACCTGAAAGA TCTGATTCGG CGTACGCAGG GCGGCGCCCC GGAGGACCGG
CGTCTCCAGG TAGCCGAGTT GATGCGCCCG GCCACCTTCG TGCCGGAGTC CAAGCCGGTT
GACGACCTTC TCTCGGAGAT GCAGGCTGCC CGGAACCACC TGGTGATCGT CGTTGACGAG
TACGGCGGGA CGGGCGGCCT GGTCACCATC GAGGACATCC TGGAGGAGAT CGTCGGCGAG
ATAACCGACG AGTACGATGT TGAGCGTCCG CCGGTCGAGC ACCTCGACGA CGACGCGGTG
CGGGTCACCG CGCGGCTACC CGTGGAGGAC CTCGGCGAGT TGTTCGACAC CGAGCTGCCC
AGCGATGAGG TGGAGACGGT GGGCGGCCTG CTCGCGCAGT CCCTGGGCCG GGTTCCGATC
CCCGGCGCCC AGGTTGAGGT GGCCGGTCTA CGGCTGCTCG CGGAGGGCAC CACCGGGCGG
CGCAACCGGA TCGACACGGT CCTGGTGCGC CGGGTGGAGC CGGCCGACCA ACAGCACGAT
CCGGGGCGCG GCGACACCGC CGACGTCCCG GGCGACACCG ACCAAGCCGA GGAGAGGCAG
CCCGCCGATG CCTGA
 
Protein sequence
MMASLAAATA GAAGLPDVQL VFLAAGLVVL AGLIGMTEAA LAAVSPARAA ELVRDGARGA 
RALQSVAGDV VRHLNLLLLL RLLAELSATT LVALVAVDSL GAGWRAAVVT AGAMTVVSFV
VVGVGPRTIG RQHAYAVGRG VAPLVRWLGR ALNPLASLLI LIGNAVTPGR GFREGPFATQ
VELRELVDLA EQRGVVEHGE RQMIHSVFAL GDTIVREVMV PRTEMVWIER HKMLSQALAL
FLRSGFSRIP VIGESVDDVL GVLYLKDLIR RTQGGAPEDR RLQVAELMRP ATFVPESKPV
DDLLSEMQAA RNHLVIVVDE YGGTGGLVTI EDILEEIVGE ITDEYDVERP PVEHLDDDAV
RVTARLPVED LGELFDTELP SDEVETVGGL LAQSLGRVPI PGAQVEVAGL RLLAEGTTGR
RNRIDTVLVR RVEPADQQHD PGRGDTADVP GDTDQAEERQ PADA