Gene Strop_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1643 
Symbol 
ID5058102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1872285 
End bp1873256 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content64% 
IMG OID640473916 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001158486 
Protein GI145594189 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.010471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCCA TCGTTAACAA GAGGGTCCTG GCGGGCGTCT CGCTGTCCAC GGTGGCGGCT 
CTCGCGCTCA CCGCGTGCGG TGGGACCAAG ATCGAGTCGA CGGACCCCGC CGAAGCGGGC
GACTGCGGCA CCTTCACCAT CGCGATCAAC CCCTGGGTGG GGTACGAGGC GAACGCGGCC
GTCATCGCCC ACGTCGCCGA GACCGAACTC GGCTGCAAGG TCGTCAAGAA GGATCTCAAG
GAGGAGATCG CCTGGCAGGG CTTCGGCACC GGTCAGGTGG ACGCGATCGT GGAGAACTGG
GGCCACGACG ACCTCAAGAA GAAGTACATC GAGGATCAGA AGACCGCGGT GAACGCCGGT
TCGACCGGTG TCGAGGGTGT CATCGGCTGG TACGTGCCGC CATGGATGGC CGAGGAGTAC
CCCGACATCA CCGACTGGAA CAACCTGAAC AAGTACGCCT CCCTCTTCGA GACCACGGAG
TCCGGCGGCA AGGGACAGCT GCTCGACGGT GACCCGTCCT TCGTCACCAA CGACGAAGCC
CTGGTCAAGA ACCTGGGGCT GGACTACCAG GTGGTGTACG CGGGCAGCGA GCCGGCCCTG
ATCCAGGCGT TCCGTCAGGC GGAGCAGGAG AAGAAGCCGG TGCTCGGCTA CTTCTACGAC
CCGCAGTGGT TCCTCTCCGA GATCGAACTG GTCAAGGTGA ACCTGCCCGA GTACGAGGAG
GGCTGCGACG CCGACCCGGA GAAGGTCGCC TGCGACTACC CGGTGTACGA CCTTGACAAG
ATCGTGAGTA AGTCGTTCGC CGACGCCAAC GGGCCCGCCT ACCAGCTGGT CGACAACTTC
AACTGGAGCA ACGAGGACCA GAACGTGGTG GCCCGGTACA TCGCCCAGGA CAACATGTCG
CCGGAGGAGG CGGCTGAGAA GTGGGTCGAG GCCAACAAGG ACAAGGTTGA GGCCTGGCTG
CCGCAGAGCT GA
 
Protein sequence
MRSIVNKRVL AGVSLSTVAA LALTACGGTK IESTDPAEAG DCGTFTIAIN PWVGYEANAA 
VIAHVAETEL GCKVVKKDLK EEIAWQGFGT GQVDAIVENW GHDDLKKKYI EDQKTAVNAG
STGVEGVIGW YVPPWMAEEY PDITDWNNLN KYASLFETTE SGGKGQLLDG DPSFVTNDEA
LVKNLGLDYQ VVYAGSEPAL IQAFRQAEQE KKPVLGYFYD PQWFLSEIEL VKVNLPEYEE
GCDADPEKVA CDYPVYDLDK IVSKSFADAN GPAYQLVDNF NWSNEDQNVV ARYIAQDNMS
PEEAAEKWVE ANKDKVEAWL PQS