Gene Sare_4528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4528 
Symbol 
ID5706018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5120395 
End bp5121639 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content73% 
IMG OID641273942 
Productmajor facilitator transporter 
Protein accessionYP_001539291 
Protein GI159040038 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2807] Cyanate permease 
TIGRFAM ID[TIGR00896] cyanate transporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00228115 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCACGG CCCTCGGCGG GAAGCGTGGA GCAGCGCGTC GAGGCGCGCT CGTACTGGTC 
GGCATGGTCC TGGTGGGTGC GAACCTGCGG GTGGCCGTGA CCAGCCTCGG TGCGCTGCTC
GACGAGGTGC GTACCGGGCT CGGACTGTCC GGAACCATGG CCGGCGTGGT CACCACGCTG
CCGACGGTCG CGTTCGCCGG GCTCGGGGCG ATCACCCCGT GGCTGGTCCG TCGGGCGGCA
CCGGCACGGT TGCTGGTGGT GGCCATGCTC GCGCTCACCG CCGGCCAGGT GCTCCGGGCG
GCCACCGGCT CCGCGCTGGC CTTCGTCATC ACCAGCGCGC TGGCACTGGC CGGGATCGCG
GTGGCGAACA TCCTCCTGCC CCTGCTGGTC AAGCAGTACT TTCCACACCG CACCGGACTG
GCCACCGGGG CGTACACGAT GGCCATGACC GTGGGCACGA CGGTGGCCGC CGCCGCGGCG
GTGCCGACCG CGCACGCCTT CGGCTCGTGG CGGGCCGGGC TCGGCGTCTG GGCTGGGCTG
GCCGCGGTGG CCGTACTTCC GTGGGTGGTG CTGGCCCGTC GGGCCCGGAC CGAAGCGGGG
CGGTCGAGCC CGACGGAGAC CGTCACCCGG ACCCGGGCGC GCCCGGCGCG AACCCGGCTC
GGCTGGGCCA TGGCCGTGTA CTTCGGCACG CAGTCACTCA GCGGGTACGC GATCATGGGC
TGGTTGGCGC AGCTGTTCCG CGACTCCGGG TACCAGCCGG CGACGGCGGG TCTCCTGCTC
GCCGGGGTGA CGGCGCTGGG CGTGCCGGTC GCGCTCGTGA TGCCGATGCT GGCCGGCCGG
CTGCGGACGC TGCGACCGCT GGTGTTGTCG TTGGCCGCCG CGTCGGCGCT GTCCTACCTG
GGGCTGGCAC TGGCACCGTA CGACGGCGCG TTGCTCTGGG TGGCTCTGCT GGCCCTGGGT
CAGGGTGCCT TTCCGCTGGT CCTTGCGACC ATTGGGCTAC GGGCACGCAC GGCGGAGGGG
ACAGTGGCAC TGTCGGCGTT CGCGCAGAGC ACCGGCTATC TCATCGCAGC GCTGGGGCCG
CTGTTCGTGG GCGTCCTCTA TGGGGCCACC GGGGGGTGGA CCGTTCCGGT CGGCTTTCTG
CTGTCGGCGC TCGCGGTGCA GACCTGCGTG GGCCTGGTCA TCGCCCGCCC ACGCTACATC
GAGGACGAGG ATGGGTTCGT GGCCGGCCGG CACGCCGGTC GCTGA
 
Protein sequence
MATALGGKRG AARRGALVLV GMVLVGANLR VAVTSLGALL DEVRTGLGLS GTMAGVVTTL 
PTVAFAGLGA ITPWLVRRAA PARLLVVAML ALTAGQVLRA ATGSALAFVI TSALALAGIA
VANILLPLLV KQYFPHRTGL ATGAYTMAMT VGTTVAAAAA VPTAHAFGSW RAGLGVWAGL
AAVAVLPWVV LARRARTEAG RSSPTETVTR TRARPARTRL GWAMAVYFGT QSLSGYAIMG
WLAQLFRDSG YQPATAGLLL AGVTALGVPV ALVMPMLAGR LRTLRPLVLS LAAASALSYL
GLALAPYDGA LLWVALLALG QGAFPLVLAT IGLRARTAEG TVALSAFAQS TGYLIAALGP
LFVGVLYGAT GGWTVPVGFL LSALAVQTCV GLVIARPRYI EDEDGFVAGR HAGR