Gene Sare_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3000 
Symbol 
ID5707610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3408172 
End bp3409128 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content65% 
IMG OID641272447 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001537815 
Protein GI159038562 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.512317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000353502 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGGAT CGCTCGGAAC GGCGACCACA GTTCGGCCGA CGGGGTCGGA TAGCGACCTC 
GGTGAGCGCA GTGCGGCACA ACAGCGCACC AGAACCCCGT ACGAATGGCC GGCGCGGGCG
TTCAGCCTGC CCGCGGTGCT TCTCGTGGCG CTGCTGCTGT ACCTGCCGTT CGTGTGGACC
ACGTACATCA GCTTCACCGA CTACAACGGC TTAGGCAGCC CGGTGTGGGT GGGTCTGAGC
AACTACCGGG ACATGGTCTC CGACTCCGTG TTCCTCACGG CGGTCGGCAA CACGCTGCTC
TGGGTCGTAG GCAGCATCTC CCTTCCCGTG GCGTTGGGTC TGCTCATAGC GGTGCTGACC
TACGGACGTC GCTTTGGCAC CCTGATTCGG CTGCCGTTTC TCATCCCGTA CGCCGTGTCG
GGCGTCGCCG TCGGCGTCAT CTGGGGATTC GTGCTGCAGA CCCAAGGCGC CCTGGGTCAG
GTGCTGGAGT TCCTCAACCT TCCCGGTGCC AGCACCAGGT GGCTCCTTGA GGGCCCACTG
AACACGTTCG TCATGATCGG CGCGGCGTCC TGGCAGATCG TCGGTGTCAA CGCCCTGCTG
TTCGTGATCG GGTTGCAGTC CATTCCCAGG GAACCCGTCG AGGCCGCTCG GCTCGACGGT
GCCACCGGCT GGACAATGTT CCGGTGCATC GTCTGGCCGC AGATGCGTGC GCTCACCACC
GTCGTGATCG GGCTGTCGAT CGTGGCGAGC CTCAAGACGT TCGACATCGT GTGGATCATG
ACTCAGGGTG GCCCGGGCCG TGTGTCGGAG ACCCTCGCCC TGACGATGTA CCGCGAGACC
TTCGTCCTCA GTGACTACGG CCAGGGTGCC GCCATCGCAG TCTTCCTCAG CGTTGTCACC
TTCGCCGCCT CGATCATCTA CCTGCGACGG CAGCTTTCCG ACCGAGCGAC GAGTTGA
 
Protein sequence
MSGSLGTATT VRPTGSDSDL GERSAAQQRT RTPYEWPARA FSLPAVLLVA LLLYLPFVWT 
TYISFTDYNG LGSPVWVGLS NYRDMVSDSV FLTAVGNTLL WVVGSISLPV ALGLLIAVLT
YGRRFGTLIR LPFLIPYAVS GVAVGVIWGF VLQTQGALGQ VLEFLNLPGA STRWLLEGPL
NTFVMIGAAS WQIVGVNALL FVIGLQSIPR EPVEAARLDG ATGWTMFRCI VWPQMRALTT
VVIGLSIVAS LKTFDIVWIM TQGGPGRVSE TLALTMYRET FVLSDYGQGA AIAVFLSVVT
FAASIIYLRR QLSDRATS