Gene Sare_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2001 
Symbol 
ID5704366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2298780 
End bp2300084 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content69% 
IMG OID641271497 
Productmajor facilitator transporter 
Protein accessionYP_001536868 
Protein GI159037615 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.548346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00155149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGTCGC TGCTGCTCGT GGTGTCCAGT GGGCGCATGA CACCCGCGAC CGTGGTGCGG 
CGATTCCCGT ATTCATCGCC GTACTGGCCT GTGCTGAGCA ATTCGTTGCT GCGTCGGATC
CTGCCCGGCC TGACGGTGTC CGCCCTGGGC GATGGCATGG CGGTCGTGGC GGTGAGCTGG
TTGGCGTTGC AATTGGCGTC GCCGGGGCAG CGCGGCCTGT GGGTGGCGAT CGCGGTAGCG
GCTTACACCG TCCCCAGCGT GGTCGGCACG CTGGTGTTCG GCCGGGTGTT GGGCGGGCGG
AACGGTGCGC AGTTGGCCGG GTGGGACGCC ACCCTGCGCG CGGGGACGCT TGCGGCGATT
CCGGTCGCCT ACCTCTTTGG GGCGTTGAGC CTCGGGTTGT ACGTGACCTT GTTGGCCGTC
TCGTCGCTGT TGCACTCGTG GGGTTCGGCG GGACGCTACA CGCTGATCGC CGAGGTGCTC
CCGGTACGCG ACCACCTGGC GGGTAACGCC GTCCTCGGCA TCATCGCCGA GATGGCCACC
ATCGGTGGGC CTCCGCTGGC CGGACTCCTG ATCAGCTGGG GCGGAGCAAT CTGGGTGATC
GCCATCGACG CAGCCACCTT CGCCGTCCTC GCGCTCACTT ACCGGCTGGC TGTACCCGCC
GCCGACAGAC CGGCACCGGC GCAAACTGGC GCTTCCCGCA CCGCCGGCTT CGGCGTCATC
CGCCGCAACC GCAGCCTGCT CGGCCTGCTT ACCCTGAGCT TCGGGTTCTT CTTCCTCTTC
GGCCCCGTCT ACGTCGCCCT TCCCCTCTAC ATCACAGACG AACTGCACGC CTCGGCGACC
CTGCTCGGCA CTTATTACAT GGCATTCGGT GCGGGTGCCC TCGTCGGCGG CTTGACCGTG
GGCTACCTAC GCCGCAGGCC GCTGTGGGTC GTCACCATCG GCATCGTCGT GGGCTTCGGT
CTCACCATGC TGCCCCTCGG GCTGGGCGCA CCCGTCAGTT TGTCCCTGCT GTCCTTTGCC
ATCGGCGGGG CGGTGTGGGC GCCGTACATG CCCACGTCGA TGGCGTTGTT CCAACGCAGT
ACCACGGCCG CGAACCGCCC GCAGGTCCTC GCCGCCAACG GCGCCGTCAC CGTGGTGGCG
GTACCGGCGG GCACCATGCT CGGCGGCCCG CTCGTGAGTG CCCTCGGCGC CCACGAGACG
TTGCTGTTCT GCGCTATCGC CATCATCGCC TTCGGAGTGA TCGCCACCGG CTTGACTGTC
CTCCATCGCC TCGCGCCTCC CGTCGGCGAC ACCGAGAGGG AGTGA
 
Protein sequence
MLSLLLVVSS GRMTPATVVR RFPYSSPYWP VLSNSLLRRI LPGLTVSALG DGMAVVAVSW 
LALQLASPGQ RGLWVAIAVA AYTVPSVVGT LVFGRVLGGR NGAQLAGWDA TLRAGTLAAI
PVAYLFGALS LGLYVTLLAV SSLLHSWGSA GRYTLIAEVL PVRDHLAGNA VLGIIAEMAT
IGGPPLAGLL ISWGGAIWVI AIDAATFAVL ALTYRLAVPA ADRPAPAQTG ASRTAGFGVI
RRNRSLLGLL TLSFGFFFLF GPVYVALPLY ITDELHASAT LLGTYYMAFG AGALVGGLTV
GYLRRRPLWV VTIGIVVGFG LTMLPLGLGA PVSLSLLSFA IGGAVWAPYM PTSMALFQRS
TTAANRPQVL AANGAVTVVA VPAGTMLGGP LVSALGAHET LLFCAIAIIA FGVIATGLTV
LHRLAPPVGD TERE