Gene Sare_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1661 
Symbol 
ID5703431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1912681 
End bp1914216 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content66% 
IMG OID641271165 
Productextracellular solute-binding protein 
Protein accessionYP_001536540 
Protein GI159037287 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.226847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCA TCAGATTTCG GGGGGCCGGC GTCGCGCTGG CTCTGACACT CGGTCTGCTG 
GCGGGATGTA GCGCGGGGGA GGGCGTCGAC GTCGACGGGT CGGGCCAGTC TGGTGCCGGC
GGTGTCCTCA CTGTCGCGAT CAGTGGGGAA CCGGATCAGC TGGACCCGCA TCGGACCTCG
GCCTACCACA GCTTCCAGGT GCTCGAGAAC GTCTACGACA CGCTCGTGGA GCCGGACGCG
AACCTGGCGA TGAAGCCGGC CCTGGCGACG GAGTGGAGCA CCAGCGAGGA CCAGTTGACC
TGGACGTTCA CCCTCCGTAA GGGGGTGACG TTCACCGACG GTTCGCCGCT TACCGCCGAG
GACGTGGTCT ACTCGTACAC CCGGATCATC GACGAGAAGT TGAATGCGGC GTACCGGTTT
TCCACGGTGG AGTCGGTGAC GGCCCCCGAC CCCGGTACCG TCGTCGTGAC GCTGACCGCG
CCCACCCCGA ACCTGCTCGC CAGCCTCGGC GGCTTCAAGG GAGTGGCGAT CGTCAAGAAG
TCCAACGTCG AGTCGGGCGC GGTGAAGACC GAGCCGATCG GTAGTGGTCC GTTCACTGTG
GCCTCCTACA CTGCCGGGGA CAGCATCAAG CTGGTGCGCA ATGACAGCTA CTGGGGCACC
AAGCCCAAGC TGGACGGGGT GACCTTCACC TTCGTCAAGG ACCCGACGGT GGCCCTGCAG
AACCTGCGCG GTGGTGAGGT GCAGTGGACC GACAACCTGC CCCCGCAGCA GGTGCCGGCG
CTTCGGGAGG ACGACGAGCT CGTCGTGCGT TCGGTGCCGT CGAGCGACTA CTGGTACCTG
GCCCTCAACC AGTCCCGTGA GCCCTACGAC AACGTCGAGG TACGCCGGGC GGTCGCCTTC
GCGCTCGACC GAGCGGCGAT CACCAAGGCC GCCAAGTTCG GGCTGGCGAC GGTCAACCAG
ACCGCCATCC CCGAGGACAG CGCCTTCTAC TACGACTACG CGCCGTACCA GCGGGACCCG
GCGCAGGCGA AGCAACTGCT GGCCGCGGCC GGCGTGACGG ATCTGACCAT GGACCTGATG
GTCACCAACG AGTACCCGGA GACAGTCACC GCAGCGCAGG TCATCGCCGC GCAGCTCAAG
GACGTCGGCA TCACTGTCAC GATCCGTACG TTGGATTTCG CCCAGTGGCT CGACGAGCAG
GGCAAGGGAA ACTTCGACTC GTTCATGCTC GGCTGGCTGG GCAACATCGA CCCCGACGAG
TTCTACTACG CCCAGCACCA CAGCCAGGGC ACCTTCAACT TCCACGGATA CCGCAACCCA
GCCGTGGACA GCCTGCTCGA CCAGGCCCGG ACCGAGACCG ACCAGGCCGC GCGTAAGCGG
CAGTACGAGC AGGTGGCGAA GCGGATCGTC GACGACGCCA GCTACCTCTA CCTCTACAAC
CCGGATGTGG TGCAGGGCTG GTCGCCGCAG GTCAGCGGCT ACCAGGTCCG TGCCGACCGG
GCGATTCGGT TCCGCGACGT CAGCCTCGAC CGGTGA
 
Protein sequence
MSSIRFRGAG VALALTLGLL AGCSAGEGVD VDGSGQSGAG GVLTVAISGE PDQLDPHRTS 
AYHSFQVLEN VYDTLVEPDA NLAMKPALAT EWSTSEDQLT WTFTLRKGVT FTDGSPLTAE
DVVYSYTRII DEKLNAAYRF STVESVTAPD PGTVVVTLTA PTPNLLASLG GFKGVAIVKK
SNVESGAVKT EPIGSGPFTV ASYTAGDSIK LVRNDSYWGT KPKLDGVTFT FVKDPTVALQ
NLRGGEVQWT DNLPPQQVPA LREDDELVVR SVPSSDYWYL ALNQSREPYD NVEVRRAVAF
ALDRAAITKA AKFGLATVNQ TAIPEDSAFY YDYAPYQRDP AQAKQLLAAA GVTDLTMDLM
VTNEYPETVT AAQVIAAQLK DVGITVTIRT LDFAQWLDEQ GKGNFDSFML GWLGNIDPDE
FYYAQHHSQG TFNFHGYRNP AVDSLLDQAR TETDQAARKR QYEQVAKRIV DDASYLYLYN
PDVVQGWSPQ VSGYQVRADR AIRFRDVSLD R