Gene Strop_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2983 
Symbol 
ID5059447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3411952 
End bp3413355 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID640475234 
Productextracellular solute-binding protein 
Protein accessionYP_001159799 
Protein GI145595502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.105062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTA CCCCCGATCT CAACCGCCGG AGCCTGCTGC GTCGCGCCGC CGCCGCGGGT 
CTGCTGACCC TCCCGGCCGC CGGCCTGCTC AGCGCCTGCG CCGGCAGCGA GCCAGCCCAG
GACGACAACT CCGGTGCCGC GAAGACCAAG GACAACCCGT TCGGTGTCCA GGACGGCAGC
TCCGTCAAGG TGGTCATCTT CAACGGCGGG CTGGGCGACC AGTGGGCCAA GGAGGACGAG
GCCGTCTTCA AGGCCAGGCA TCCGAGCGTC ACGGTCAACA TGTCCTCGAC CCAGAAGATC
AAGACCGAAG AGCAGCCGAA GATGGCGACC CAGCCCAGCG ACGTCGTCAT GAACTCCGGC
GCCGACAGTA TGGACATCAG CACCCTGGTC AACGAGGGCG CGATCGAGCC GCTGGAGGAC
CTGCTCGCCG CCCCGGCGTG GGACAGCGAG GGCACGGTGG CGGACACCCT GCTGCCGGGG
ACCGTCAACG ACGGCACCTT CCAGGGCAAG TTCTACGTGG TGAACATCGC GTACACGGTC
TGGGGTAACT GGTACAACGC CGCCCTCTTC GACAAGGAGG GCTGGCAGCC GCCGAAGACC
TTCGACGAGT TCTTCGCCCT CGCGCCGAAG ATCAAGGCGA AGGGCATGGC CCCGTACGTC
TACGACGCGG TGCACGGCTA CTACCCGCGG TGGGCGCTGA TGGCGACGAT CTGGAAGTCC
GCCGGCAAGC AGGCCGTGAT CGACATCGAC AACCTCAAGG AGAACGCCTG GAAGGCTGAC
GGGGTGCTAC CGGCCCTCGA GGCGTGGGAG AAGCTGGTCA AGGACAAGCT GCTGCTCCCC
GGCCAGCTGG ACCACACCCA GTCGCAGCAG GCGTGGCTCG ACGGCAAGGC CGCTTTCATC
CAGGTCGGCA CCTGGCTCAA GAACGAGATG GCGGAGACCA TCCCCCCGGG CTTCGAGATG
ACGCTGTCGG ACTACTGGAG CCTGGGGGCG GGCGACAAGG CACCGAACGA CGTCTACGCC
GGTGCGGGCG AGAACATCGT CGTGCCCTCG AAGGCGCCGA ACAAGGCCGC TGCCAAGGAG
TTCCTGCGGG CGGTGCTCTC CAAGGAGGGC TCGGCGAAGT TCGCTGAGCT GACCAAATCC
CTCGCCTCCA CCAAGGGCTC CGGGGACAAC GTCGAGGACT CGGCGCTGGC CAGCGCGAAC
GAGCTGATGC GCAACGCACC ACAGGATCTC GTCTCGTTCA AGTTCTGGAA CTTCTACGCC
GACCTGGACA AGGAGAGCCA GAACCTCTCC GCCGAGCTGA TGGCCGGCCG GATGACCGCC
CAGCAGTTCG TCGACGGTAT GCAGGGCGCC GCTGACAAGG TCGCCAAGGA CTCGTCGATC
AAGAAGCAGA CCCGCTCCGC CTGA
 
Protein sequence
MSATPDLNRR SLLRRAAAAG LLTLPAAGLL SACAGSEPAQ DDNSGAAKTK DNPFGVQDGS 
SVKVVIFNGG LGDQWAKEDE AVFKARHPSV TVNMSSTQKI KTEEQPKMAT QPSDVVMNSG
ADSMDISTLV NEGAIEPLED LLAAPAWDSE GTVADTLLPG TVNDGTFQGK FYVVNIAYTV
WGNWYNAALF DKEGWQPPKT FDEFFALAPK IKAKGMAPYV YDAVHGYYPR WALMATIWKS
AGKQAVIDID NLKENAWKAD GVLPALEAWE KLVKDKLLLP GQLDHTQSQQ AWLDGKAAFI
QVGTWLKNEM AETIPPGFEM TLSDYWSLGA GDKAPNDVYA GAGENIVVPS KAPNKAAAKE
FLRAVLSKEG SAKFAELTKS LASTKGSGDN VEDSALASAN ELMRNAPQDL VSFKFWNFYA
DLDKESQNLS AELMAGRMTA QQFVDGMQGA ADKVAKDSSI KKQTRSA