Gene Strop_1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1641 
Symbol 
ID5058100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1869132 
End bp1870211 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID640473914 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001158484 
Protein GI145594187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.565907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.509046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG ATACGGCGGT TCAGCCGCGC GGGCAGGTCA ATGATCGCCA GACTCCGGTG 
ATCTCGGTCC GGAACCTGTG GAAGGTGTTC GGCCCGAACG CGGAGCGGGT GCCGAGCTCG
ACCGAGCTCG CGGGGCTGTC CCGGCGGGAA CTGCGGGAGC GGGCCCGGTG TACCGCCGCG
GTACGGGAGG TGTCGTTCGA CGTCGCGCCG GGAGAGGTCT TCGTCGTGAT GGGGCTGTCC
GGCTCCGGCA AGTCCACGCT GGTGCGCTGC CTGACCCGGC TGATCGAGCC CACCGCGGGG
GAGGTGGTCT TCGAGGGCGA GGACATCCTG CGTGCCGACA AGAAGCGGCT GCGGGAGCTC
CGTCGCCGTA AGTTCTCGAT GGTCTTCCAG CACTTTGGTC TCCTGCCGTA CCGAACGGTC
GTCGACAACG TCGGGTATGG ACTGGAGATC CGGGGTGCCG GCCGCGCCGA GCGGATCCGC
CGGGCGACCG AGGTCATCGA GCTGGTGGGC CTCGACGGCT ACGAGCAGGC GTACCCGGAC
CAGCTCTCCG GCGGGATGCA GCAGCGGGTC GGGCTGGCGC GGGCGCTGGC CGGTGACCCG
GACGTGCTCT TCTTCGACGA GCCGTTCTCC GCGCTGGACC CGCTGATCCG CCGTGACATG
CAGAACGAGG TCATCCGACT GCACCGTCAG GTTGGTAAGA CGATGGTCTT CATCACGCAC
GACCTCTCCG AGGCGCTCAA ACTCGGCGAC CGGATCCTGC TCATGCGCGA CGGCAACGTG
GTGCAGGCCG GGACCGGGGA CGAGTTGGTC GGGGCGCCGG CCGACGACTA CGTGCGCGAC
TTCGTCCAGG ACGTGCCCCG CGCCGACGTT CTCACCCTGC GGTGGATCAT GCGTCCATCC
CGGGACGCCG ACCAGCTGGA CGGTCCTCAG CTGGGGCCGG GTGTCATCGT GCGCGACGCG
GTCCGCACGG TGCTCGCCGC CGACCGGCCG GTGCGGGTCG TCGAGAACGG GGAGCTGCTG
GGCGTGGTCG GCCATGAGGA AGTCCTCAAT ATTGTCGCCG GCACGCAGGC GGGCGCCTAA
 
Protein sequence
MSTDTAVQPR GQVNDRQTPV ISVRNLWKVF GPNAERVPSS TELAGLSRRE LRERARCTAA 
VREVSFDVAP GEVFVVMGLS GSGKSTLVRC LTRLIEPTAG EVVFEGEDIL RADKKRLREL
RRRKFSMVFQ HFGLLPYRTV VDNVGYGLEI RGAGRAERIR RATEVIELVG LDGYEQAYPD
QLSGGMQQRV GLARALAGDP DVLFFDEPFS ALDPLIRRDM QNEVIRLHRQ VGKTMVFITH
DLSEALKLGD RILLMRDGNV VQAGTGDELV GAPADDYVRD FVQDVPRADV LTLRWIMRPS
RDADQLDGPQ LGPGVIVRDA VRTVLAADRP VRVVENGELL GVVGHEEVLN IVAGTQAGA