Gene Sare_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2171 
Symbol 
ID5704955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2495177 
End bp2496919 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content70% 
IMG OID641271654 
Productextracellular solute-binding protein 
Protein accessionYP_001537025 
Protein GI159037772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.596914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0132313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACCGGC CTGAACCACG GAAGGTGACA GACCCCGTGT CGACACGTCG ACCGCCTCCT 
CGGCCCGGCG GTACGCTGCG CTACTACGGG CCCGGTGGGC TTGACCACCT GGACCCCGCC
GCCGCCTACT ACGCCTTCTC GCACCAGATC ACCCGACTCT TCGCCCGGCA ACTGTTCGGT
TATCCGACTA CCGAGGACGC CCGTGCGCTG GTGCCGGTGC CGGATGTTGC CGCCGAGGTG
CCGACCAGGA ACAACGGCGG GCTCAGCCGA GACCGTCGGA CGTACACCAT CTGGTTGCGC
GAGAGCGTCC GCTGGGACAC CGTCCCGCCT CGGTTGGTGA CCGCCGAGGA CTTCGTACGC
GGTGTCAAGC GGATGGCCAA CCCGGTCGCG GGGGCGGGTG CCATCGCCTA CTACACCAGC
ACGATCGTCG GTATGGCCGA GTTCGCCGAG GGGTACCGGG CCTGCTTCGC CGGCCGTACG
CCCACCGCTC GGGACCTGGC CACGTACCAG AACGACCATG ACATTCGGGG CCTGTGGGCT
GTCGACGACC GCACGTTGGT GATCGAGCTG CTGCGGCCGG CCAACGACCT GCTCAACCTG
CTGGCGATGC CGTTCGCCTC GGCGGCCCCC CGCGAATACG ACGATCTCGT CCCGGACGGC
CGGGACTTCG CCCGGCTGGT CCGTTCCAAC GGCCCGTACC GGATCACCAG CTACGTGCCC
GGCAGCCATC TCACGATGGC GCACAACCCC GCCTGGCAGG CCGAGACCGA CCCGATCCGG
CGTCGGTACG TGGACCGGAT CGATGTCCGC ATGGCGCGGG TGAGTGACGA GCGGGTCCGT
TCCGAAATCA CCAGCGGTCG GGCGGATCTG TCCTGGGGGG CCGCGGTGGG ACGGCCGCGC
CGACGCACCG CGGCCGACCG TGACCTGGGG TGGGCGTTGA ACCCCTACCT TGTGTTCAAC
CTGCGCAGTC CGCACGAGCA GGGCGCGTTG CGGGACCGTC GCGTCCGGTT GGCCATCGCC
TACGCCGTCG ACAAGGCCCG CCTGGTGCGG TACTTCGACG AGATGAACAT CGGTACGCGC
ACCCGGCCGG CCCGCACGGT GATCCCGCCG GGCAACGTCG GGCACCGCGG GTACGACCCC
TACCCCACGC CGGGCGACCG AGGCGACCGA GGGCGCTGCC GGGAACTGCT GGCCGAGGCG
GGCCACCCGG GCGGGCTGAC CCTGACCATG ATCTATCGGA TCGACGCGGT GCACGGGCGG
GTGGCCAAGG CGATCGCCGA GGACCTGGCC GCGGGAGGCG TCGATGTGCG GTTGGTGGAA
GTCGATCGAA CCGACGAGTA CTACCGCATC CTCCAGGACC CGAGACGCGC GGCAGCGGGG
GAGTGGGATC TGACTCCGGC GGCGTTCATG CCGGACTGGT TCGGCAACAA CGGCCGGTCG
TACGTCCAGC CGATGTTCCA GTCCCACTCG GCGGTCGGCA CCGCGAACTA CGGCGGTTAT
CACAGTCCGG TGGTGGACGA GCTGATCGAT CGGGCCCTCG CCGCCCAGAC GGAGGCGCGG
GCCGCGGAGC TGTGGCACCA GGTCGACCGG CAGGTGCTCG CCGACGTGGC GGTCGTGCCC
ATCCTGGTCT GCGAGCCGAC CATCGAGCAC CTGACCAGTG ACCGGGTGCG CAATGCCATC
CCGCTGCCGC ACGTGGACCG CTGGTACGAC GCGTCGAACC TGTGGCTCGA CGCGGCCGAC
TGA
 
Protein sequence
MYRPEPRKVT DPVSTRRPPP RPGGTLRYYG PGGLDHLDPA AAYYAFSHQI TRLFARQLFG 
YPTTEDARAL VPVPDVAAEV PTRNNGGLSR DRRTYTIWLR ESVRWDTVPP RLVTAEDFVR
GVKRMANPVA GAGAIAYYTS TIVGMAEFAE GYRACFAGRT PTARDLATYQ NDHDIRGLWA
VDDRTLVIEL LRPANDLLNL LAMPFASAAP REYDDLVPDG RDFARLVRSN GPYRITSYVP
GSHLTMAHNP AWQAETDPIR RRYVDRIDVR MARVSDERVR SEITSGRADL SWGAAVGRPR
RRTAADRDLG WALNPYLVFN LRSPHEQGAL RDRRVRLAIA YAVDKARLVR YFDEMNIGTR
TRPARTVIPP GNVGHRGYDP YPTPGDRGDR GRCRELLAEA GHPGGLTLTM IYRIDAVHGR
VAKAIAEDLA AGGVDVRLVE VDRTDEYYRI LQDPRRAAAG EWDLTPAAFM PDWFGNNGRS
YVQPMFQSHS AVGTANYGGY HSPVVDELID RALAAQTEAR AAELWHQVDR QVLADVAVVP
ILVCEPTIEH LTSDRVRNAI PLPHVDRWYD ASNLWLDAAD