Gene Strop_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2006 
Symbol 
ID5058469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2276570 
End bp2277607 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID640474272 
Productextracellular solute-binding protein 
Protein accessionYP_001158838 
Protein GI145594541 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.569092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.043558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCA CCCCCCTGGC TCTCGCCACC CTCGCGGTGG CCTCGCTCGC GCTCGCCGCG 
TGCGGATCAG GCACCTCCGA CCCGGGTGGG GCTGACGGCG ACAAGACCGT CACCGTCTAC
TCCGCCGACG GGCTCGGCGA CTGGTACAGC AAGCAGTTCG TCGAGTTCGA GAAGCAGACC
GGCATCAAGG TACAGATGAT CGAGGCCGGC TCCGGTGAGG TCGTCTCTCG GCTACAGAAG
GAGAAGGCGA ACGTCCAGGC GGACCTGGTC GTCACGCTGC CGCCCTACAT CCAGAAGGCC
GACGCCGACG GGCTGCTACA GCCTTACACG CCGGCCGGCG CCGACCAGGT GACCGGTGCG
ACCGACACCT ACGTGCCGTT GGTGAACAAC TACCTCTGCT TCATCTATAA CCCGGACAAG
GTCGACGCCG CCCCGACGAC GTTCGACGAT CTGCTCAGCC CCGTGTTCGC CAAGAAGCTT
CAGTACTCGA CGCCCGGCCA GGCGGGTGAC GGCACCGCCG TGCTGCTGCA CCTGCAGCAC
ATCCTCGGCA AGGACAAGGC ACTGGAGTTC CTGGCGAAGC TCGAAACGAA CAACGTCGGC
CCGTCGTCGT CCACCGGCAA GTTGCAGCCC AAGGTCAGCA AGGGCGAGAT CTACGTGGCC
AACGGCGACG TGCAAATGAA CCTCGCGTCG ATCAACAACG ACAGGTCCAA CTTCAAGATC
TTCTTCCCGG CCGGTCCGGA CGGCAGGGCA TCCACCTTCT CCATCCCGTA CACCATGGGC
CTGGCCGCCG GCGCCCCACA TGCCGACGCC GGTCGCGAGC TGGCCGACTT CCTACTCTCC
ACCACTGCCC AGGAGCAGGT GTCCCAGCAG GCGTACGGCG TCCCGGCACG TGCCGACGTC
AAGCCCGCTG ACAAGCAGTT CCAGCAGGTC GAGCAGGCGC TGCAGGGCGT GGAGATCTGG
CCGGCCGACT GGGCGAAGAT CCTGACCGAG ATGGACGCGG ACATCGCGGC CTACAACGAG
GCCCTCGGCC TGGCATAA
 
Protein sequence
MRRTPLALAT LAVASLALAA CGSGTSDPGG ADGDKTVTVY SADGLGDWYS KQFVEFEKQT 
GIKVQMIEAG SGEVVSRLQK EKANVQADLV VTLPPYIQKA DADGLLQPYT PAGADQVTGA
TDTYVPLVNN YLCFIYNPDK VDAAPTTFDD LLSPVFAKKL QYSTPGQAGD GTAVLLHLQH
ILGKDKALEF LAKLETNNVG PSSSTGKLQP KVSKGEIYVA NGDVQMNLAS INNDRSNFKI
FFPAGPDGRA STFSIPYTMG LAAGAPHADA GRELADFLLS TTAQEQVSQQ AYGVPARADV
KPADKQFQQV EQALQGVEIW PADWAKILTE MDADIAAYNE ALGLA