Gene Strop_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3037 
Symbol 
ID5059501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3468422 
End bp3469504 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content68% 
IMG OID640475287 
Productchitin-binding domain-containing protein 
Protein accessionYP_001159852 
Protein GI145595555 
COG category[S] Function unknown 
COG ID[COG3397] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.245182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.198398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCT ATCGAGCCCG AACAGCCGCG CTCCTCACCG CGGCCACGAC TCTCCTCGCG 
GCTGCCGCGG TACTCACCGT CAGGTCGGAG CCGGCGGCGG CGCACGGCGC CGCCATGGTG
CCCGGCAGCC GTACCTTCCT TTGCTGGCAG GACGGGCTGA GCCCCACCGG GGAGATCCAA
CCGTACAACC CCGCCTGCTC GGCGGCGGTG GACCAGAGTG GGGCGAACTC GCTCTACAAC
TGGTTCAGTG TGCTGCGCTC CGACGCGGAT GGTCGTACCG TCGGGTTCAT TCCCGACGGC
CAGCTGTGCA GCGGGGGAAA CCCCGGGTTC CTCGGCTATG ACCTGGCCCG CATTGACTGG
CCACTGACGC ACCTGACCGC TGGCCAGAAC ATTGAGTTCC GCTACAGCAA CTGGGCGCAC
CACCCCGGGA CGTTCTACTT CTATGTCACC AAGGACAGTT GGAGCCCAAC CCGTCCGCTG
GCCTGGAGCG ACCTGGAGGA GCAACCATTC CTGACCGTCA CCAACCCACC CCAGCGCGGC
GGTCCGGGCA CCGATGACGG GCACTACTAC TTCGCCGGAA CGCTGCCGGC CGACAAGAGC
GGCCGACACC TCATCTACTC GCGCTGGGTC CGTTCGGACA GCCCGGAGAA CTTCTTCGGC
TGCTCGGACG TCACGTTCGA CGGAGGCAAT GGTGAGGTGA CCGGCATCGG CCCCGGTGGC
ACCGCCCCGC CACCGAGCCC GACCACTGCG CCGCCGAGCC CGACCACGCC GCCGCCCAGC
GGGGACTGCA TGGCGGTCTA CAAGGTGATC AACGCATGGC CGGGCGGCTT CCAGGGGGAG
GTCGAAATCA TGAACCACGC CGCCACCACC TGGGCCGGGT GGACGGCACG TTGGACCTGG
CCCAGCGGCC AGTCAATAGT CCAACTCTGG AGTGGCACGC ACACCACCTC AGGCTCAGCG
GTAGCGGTGA CCAACGCGTC ATACAACGGC ACGTTGGCAC CGGGAAGCAG GGCCACGTTC
GGCTTCCTCG CCAGCCTCAG CGGCACGAAC ACGTCACCGA CCGTGACCTG CACCGGTAGC
TGA
 
Protein sequence
MSSYRARTAA LLTAATTLLA AAAVLTVRSE PAAAHGAAMV PGSRTFLCWQ DGLSPTGEIQ 
PYNPACSAAV DQSGANSLYN WFSVLRSDAD GRTVGFIPDG QLCSGGNPGF LGYDLARIDW
PLTHLTAGQN IEFRYSNWAH HPGTFYFYVT KDSWSPTRPL AWSDLEEQPF LTVTNPPQRG
GPGTDDGHYY FAGTLPADKS GRHLIYSRWV RSDSPENFFG CSDVTFDGGN GEVTGIGPGG
TAPPPSPTTA PPSPTTPPPS GDCMAVYKVI NAWPGGFQGE VEIMNHAATT WAGWTARWTW
PSGQSIVQLW SGTHTTSGSA VAVTNASYNG TLAPGSRATF GFLASLSGTN TSPTVTCTGS