Gene Sare_4889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4889 
Symbol 
ID5707541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5544594 
End bp5545943 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content72% 
IMG OID641274284 
Productmajor facilitator transporter 
Protein accessionYP_001539629 
Protein GI159040376 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.295208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC GCACCGTGCC CACTGTCCGG TGGGCCGCGA TCTGGCTCGG CCAGCTCGTC 
TCGCTGGTCG GGTCGAGCCT CACCGCGTTC GTCCTCGGGG TCTGGGTCTA TCAGCGCACC
GGCTCGGTCA CCCAGTTCTC CCTGATCTTC CTTGCCGCGA CCCTGCCGGC GGTACTGTTC
GCGCCGTTCG CCGGGGCGCT TGCCGACCGC CGGGACCGCC GGGGTCTGAT GCTGGTCGCC
GACACCGTCG CCGCCGCCGG CACGGCGGCG CTCGCCGCGC TGGTCGTCGC CGACGCCCTC
CAGGTCTGGC ACATCTACCT GGGGACCGCG GTGACCGCGA CCGCGTCCAC CGTGCATCAG
GTCGCCTACC AGGCGATGAC CCCGGCCCTG GTCGGCAAGC GACATCTGGG CCGGTTCAAC
GGCCTGATGC AGGTCTCCCG CGCGGTTCAG ATCGCCGCAC CACTGATCGC CGGGGTGCTC
GTGGTGACCG TCGGGATCGG CGGGGTCATG GCGATCGATC TGGGTACCTT CGTGGTCGCG
GCGTCGACCC TGCTGCTGGT CCGGCTGCCT GCCGACGTGA CACGCCCGGC CGAATCCGGT
CCCGCCGAGG CGGTGCTGCG GGGAGCCGCC GCCGGCTGGC GCTATCTGCG GCAACGGCCG
GGCCTGCTCC AGCTCATGGT GGTCTTCGGT GCGTACAACT TCCTCTTCGG CATCGCCGGG
GTCCTGGTGC AGCCGCTGAT CCTCTCGTTC GCCACGGCGG ACACCCTCGG CCTGCTGATG
TCCGTCGGGG GTGCCGGCCT CTTCGCCGGC AGCCTGGTGA TGGGGGTGTG GGGCGGGCCG
ACCCGTCGGG TCACCGCCGT CTGCGGTGGA CTCGCGGTCG GGGGCGTGGC TCTCGTCCTG
CACGCGGCGG CCCCGTCCGC CTGGCTGATC GGGGTGGTGG CCCCGCTGTT CCTCTTCACC
CTGCCGATCG TGAACAGCTC CACCATGACC CTGATCCAGA CCAAGACCGA ACCCTCCGTG
CTGGGCCGGG TGCTCGCCAC CGCCCGGGTG ATCGGCGACG CCAGCGTGCC CCTGGCGTAC
GTGTTGGCCG GGCCGATCGC CGATGGTCTC TTCGAGCCGA TGCTGCGCCC GGAGGGTGCG
CTCGCTGATT CGGTGGGCCG GGTGATCGGC ACGGGGGAGG GCCGCGGCAT CGCGCTGCTC
TTCGCGGTCA CCGGGGTGGC GATGGTGTTC CTCGCTGTGC TCGCCTGGAC CCGGCCGGTG
CTGCGCGGCG CGGATGATCT ACCCGACGCC CTTCCCGACG ACGCCCCGGA CCCCGAGACC
GTCTCTGCCG ACCGTCAACC CGCCACGTGA
 
Protein sequence
MAERTVPTVR WAAIWLGQLV SLVGSSLTAF VLGVWVYQRT GSVTQFSLIF LAATLPAVLF 
APFAGALADR RDRRGLMLVA DTVAAAGTAA LAALVVADAL QVWHIYLGTA VTATASTVHQ
VAYQAMTPAL VGKRHLGRFN GLMQVSRAVQ IAAPLIAGVL VVTVGIGGVM AIDLGTFVVA
ASTLLLVRLP ADVTRPAESG PAEAVLRGAA AGWRYLRQRP GLLQLMVVFG AYNFLFGIAG
VLVQPLILSF ATADTLGLLM SVGGAGLFAG SLVMGVWGGP TRRVTAVCGG LAVGGVALVL
HAAAPSAWLI GVVAPLFLFT LPIVNSSTMT LIQTKTEPSV LGRVLATARV IGDASVPLAY
VLAGPIADGL FEPMLRPEGA LADSVGRVIG TGEGRGIALL FAVTGVAMVF LAVLAWTRPV
LRGADDLPDA LPDDAPDPET VSADRQPAT