Gene Sare_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3814 
Symbol 
ID5705309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4346542 
End bp4347537 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content69% 
IMG OID641273236 
Productperiplasmic solute binding protein 
Protein accessionYP_001538598 
Protein GI159039345 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.906416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0575455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC GCCGCGTTCC GCGTACCCTG GCCGCCGCCT CCGCCGCCCT ACTCACCCTC 
GGCGCCGCCG CCTGCTCCGA CAGCCAGGCA GACGCCGACC CGCAGCGAGT TGACGTGGTC
GCCGCGTTCT ACCCACTCCA GTTCCTGACC CAGCAGATCG GGGGCGATAC GGTAACCGTC
AGCAACCTGG TCAAGCCCGG CGCCGAGCCA CACGACATCG AGCTGAGCCC GAGCCAGGTT
GGTGACGTGG CAGGCGCGGA GCTGATCGTC TACCTCAAGG GCTTCCAACC GCAGGTCGAC
GACGCGGTGC AGCAGAACTC CGCCGACCGG GCGTTCGACG TGGCCACCGT CGAGCCGCTG
CTCGACGCCA CGGGCGACAA CCACAACCAC GACCACGGAC ACGAGGGCGA GGCCGGACAC
GAAGGTGAGG CCGGTCACGA AGGTGAAGGT GAAGCCGGAC ACGAGGGTGA GGCCGGCACC
AAGGATCCAC ACCTGTGGCT GGACCCGACC CGGCTCGCCA CCATCGGCGA CCAACTCGCC
GACCGGCTCG CGCAGGCCGA CCCCGAGCAC GCTGACGGGT ACACCGCCCG AGCGAAGGAT
CTCCGGACCA AGCTGGAGCA GCTCGACGCG GAGTTCACCG CCGGTCTGAA GACCTGCCAA
CGGCGGGAGA TCGTGGTCAG CCACACCGCC TTCGGCTACC TGACCACGCG CTACCAGCTG
GAGCAGATCG GCATCACCGG CCTGAGCCCG GAACACGAGC CGTCGCCGCA GCGGCTGGCC
GAGGTGATCG AGGAGGCCAA GGAGCACCAG GCCACCACGA TCTTCTTCGA GACGCTGGTC
AGCCCGAAGG TCGCCGAGAC CATCGCCGCC CAGGTCGGGG CCGAGACCGC GGTGCTCGAC
CCGCTCGAGG GGCTGTCCGC CGACAACGGC GGGGACTACT TCTCGGTGAT GCGGACCAAC
CTCGCCAACC TGCAAAAGGC TCTGGGCTGC TCATGA
 
Protein sequence
MNNRRVPRTL AAASAALLTL GAAACSDSQA DADPQRVDVV AAFYPLQFLT QQIGGDTVTV 
SNLVKPGAEP HDIELSPSQV GDVAGAELIV YLKGFQPQVD DAVQQNSADR AFDVATVEPL
LDATGDNHNH DHGHEGEAGH EGEAGHEGEG EAGHEGEAGT KDPHLWLDPT RLATIGDQLA
DRLAQADPEH ADGYTARAKD LRTKLEQLDA EFTAGLKTCQ RREIVVSHTA FGYLTTRYQL
EQIGITGLSP EHEPSPQRLA EVIEEAKEHQ ATTIFFETLV SPKVAETIAA QVGAETAVLD
PLEGLSADNG GDYFSVMRTN LANLQKALGC S