Gene Strop_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0443 
Symbol 
ID5056882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp505742 
End bp506842 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID640472716 
Producthypothetical protein 
Protein accessionYP_001157306 
Protein GI145593009 
COG category[S] Function unknown 
COG ID[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTGCG TCCGCGCCAG CGGGAAGCAC GTCCAGCCCC AGCCCGTCAG CTCGCCGAGC 
CGGCGGTCAA CGGAAGGGAT CATTGTGACG GCACCCCCAC GCCGCTGGTT GACGCCGGTG
GTGGCCGTGC TCACCGCGCT GGCCGTGCTC ACCGGGCCGA TACCGGCCTC GGCCACCCCC
ACCTCCCCCC CGCTGCCCTC GGGGCACGAC GAGGAGCCGG AGCTACTCGG TGATCTCATC
GAGGTCCGCA ACCGCGAGTA CGTCAAGGCA AAGGCCCAGC TGGCGGAGTC CGAGAAGCGC
CAGGCCGCCC TCGAGAAGGA AATCGAGAAG GCGCAGGACG ACCTGGACGA ACTAGCCCCC
CAGGTGGCGC AGATCGCGAC CCAGTCGTAC CGCACGGGAC GGGTCGGCGC GATATCGATG
TTGCTGGAGG CAGACACCCC CGACTCCTTC ATCGTCCGGG CTACCGCGCT GGACGAGCTG
AACCGCGTCA ACGACCAGCG CATCAAAGCA GTCAACACAA TCAAGATCCA CGCTGAGCAG
TCGAAGGTGG CAGTCGACGA AGAGGTACGC AAGCAACAGA AGCTGAAAAG CGACCTCGAG
CGCGGAAAGC TCGAGGCGGA GAAGGCCCTC CGCCTCGTCG GTGGCAACGG GCTCACCGGC
GGCCTGGTTG ACGCCGAATC GCCGGTCGCC CGGGTCGCCC CGGGACGCAC CTCGGATGGC
GACTGGCAGC CGCTGGGCTG CACCGAGGAT GACCCGACCA CCGGCGGCTG CATCACAGCG
CGAACACTGC ACATGTACAA CGAGGTCAAG CGGGCCGGTT TTGACCGATT CGTCGGATGC
TACCGCTCGG GTGGGCCGTG GGAGCACCCC AAGGGACGGG CCTGTGACTG GTCACTGCAG
GACAGCGGGT TCCGCTCTTG GTACAACAAC GACATGCGCC TCTACGGCAA CAACCTGACC
GCGTTCCTGG TCCGTAACGC CGACCGGCTC GGCGTCTACT ACGTGATCTG GAACCGGCAG
ATCTGGTTCC CGGCAACCGG CTGGAAGTCG TACAACGGCC CGTCGAACCA CACCGACCAC
GTCCACGTGT CGTTGCTGTA G
 
Protein sequence
MGCVRASGKH VQPQPVSSPS RRSTEGIIVT APPRRWLTPV VAVLTALAVL TGPIPASATP 
TSPPLPSGHD EEPELLGDLI EVRNREYVKA KAQLAESEKR QAALEKEIEK AQDDLDELAP
QVAQIATQSY RTGRVGAISM LLEADTPDSF IVRATALDEL NRVNDQRIKA VNTIKIHAEQ
SKVAVDEEVR KQQKLKSDLE RGKLEAEKAL RLVGGNGLTG GLVDAESPVA RVAPGRTSDG
DWQPLGCTED DPTTGGCITA RTLHMYNEVK RAGFDRFVGC YRSGGPWEHP KGRACDWSLQ
DSGFRSWYNN DMRLYGNNLT AFLVRNADRL GVYYVIWNRQ IWFPATGWKS YNGPSNHTDH
VHVSLL