Gene Strop_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3780 
Symbol 
ID5060258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4331050 
End bp4332255 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content63% 
IMG OID640476038 
Producthypothetical protein 
Protein accessionYP_001160589 
Protein GI145596292 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.769148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGCCG TACGTCAAGA GGTAGCTGCT CGGGCGACTC CTGTCCGATC GCGCCTGCTG 
GGACCCGACC TGGCGCGCGG GGGCATGCTG TTGTTGATCG CGCTGGCCAA TGTTGATGTC
TTCGCTTTTG GCTTTCTTCC AGGATTCCGG GGGTACCCCG CCGAGCAGTC CGTACTCGAC
AGTATCTTCA CCATGGTGCG GATGGTCTTG GTTGACGGCC GTGCCTTTCC GCTTTTCGCC
GCGCTGTTCG GCTACGGCCT CACGCAGCTT ATACAGAGTC GGGCTTCTGT AGGATCCGGG
CGTTCTGCTC GGCTACTGCG TCGACGTGGC ATGTGGTTGG TGGCCATTGG ATTCGTTCAC
GGCATGCTGT TGTTTACGGC GGACATCATT GCTCTCTACG GATTGTGTGC CCTCGTATTC
GCTGGGCTCG TGGTACGTCT CAGTGACCGT GGCCTGCTGG CTGTGGCCCT GTCGCTGATG
GCGCTCGCCC TGCTCTCCGG TGCCGTCCGC GGACTACCCG CGGAGGCTCT CGGCCAGGCA
GAGGTAGTCA CGGCGACACC GACCGTATTC GGTGGTGAGG CCGTCGAGGC GCTGCAGGTG
AGAATGAGTG AGTGGGCCGT AGGGGCTATT CGTCTATTCG GACTGATGCC GGCGGTGCTC
TTCGGTGTCT ACGCGGGACG TAAATCAGTC TTAACCTGGG GTTCAGAGCG GAAACGGATA
CTCAGCCTGG TCGCGTTTGC CGGACTGGCG GCCGGCATCC TTGCAGGGGT TCCTTCGGCG
CTGATGGCGG CATCAGTGTG GAATGAGCCG TCGATTGGTA TCAGTGCGAT CGCGGGAACG
CTTCACCTGG CGGGCGGGTA TGCGGCCGCA GCCGGCTACC TGGCCCTGTT CGCCCTACTC
GCGGCCGCCG CGCGGCGGCC TCCAGGCCTG ACAGTGAAGG CGCTGTCGGT GAGTGGGCAA
CGCTCCTTGA CTCTGTACCT GAGCCAGTCC CTGCTGTTCC TCGTCCTCTT TGACCCGGAC
TTTTTTGGGC TGGGTGACAA CTTCGGTATT GCCCTGAACT CTGCTGTGGC TGTCGGTGTC
TGGACCGTTG GCGTGCTCGG TGCGCTGGTG ATGGACAGGC TGTCCGTTCG TGGCCCGGCC
GAGGTGCTGC TACGCAGTCT CACCTACCGG TCGGTGACCC GATCGGCTGG ACCACGTTCT
CGTTGA
 
Protein sequence
MVAVRQEVAA RATPVRSRLL GPDLARGGML LLIALANVDV FAFGFLPGFR GYPAEQSVLD 
SIFTMVRMVL VDGRAFPLFA ALFGYGLTQL IQSRASVGSG RSARLLRRRG MWLVAIGFVH
GMLLFTADII ALYGLCALVF AGLVVRLSDR GLLAVALSLM ALALLSGAVR GLPAEALGQA
EVVTATPTVF GGEAVEALQV RMSEWAVGAI RLFGLMPAVL FGVYAGRKSV LTWGSERKRI
LSLVAFAGLA AGILAGVPSA LMAASVWNEP SIGISAIAGT LHLAGGYAAA AGYLALFALL
AAAARRPPGL TVKALSVSGQ RSLTLYLSQS LLFLVLFDPD FFGLGDNFGI ALNSAVAVGV
WTVGVLGALV MDRLSVRGPA EVLLRSLTYR SVTRSAGPRS R