Gene Ksed_17640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_17640 
Symbol 
ID8373269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1837193 
End bp1838545 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content71% 
IMG OID644992029 
Productarabinose efflux permease family protein 
Protein accessionYP_003149541 
Protein GI256825581 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00798573 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.929422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAAC GGGCAAGCAG TGCAGGGACA TGGGTCGTCG TCGCCGGCTA CTTCCTCGTG 
ATGCTGGACA CCACGATCGT GAATATCGCC CTGCCCCACC TCGGCACCGG TCTGTCAGTC
TCACCGGGTG GCCTGGCGTG GATCATGGAC GCGTACACGC TCGTGTTCGC CGCCCTGCTG
CTGCCCGCCG GCAGCGCCTG CGACCAGTAC GGGGCGCGCC GCGTCTACCT GACCGGCATC
GCCGTGTTCG CGCTCGCCTC GATCGCGTGC GCCCTGGCCC CGAACGCTGG CCTGCTCATC
GCGTCCCGCG CGATCCAAGG TATCGGGGCG GCCGCGGTCG TACCGGCCAC CCTCGCCCTG
ATCACCGAGC TCTTCACCGA CCCCGCCGCA CGAGCCACAG CGGTCGGACT GTGGGGCGCG
GCCGGCGGCG TGGCCGCGGC GGTCGGGCCG CTGCTCGGCG GCGTGCTCCT CGACGGAATC
GGCTGGCGCG CCGCTTTCTG GGTCAACGTG CCCGTCGTCG TCGCCATAGC GATTGGCGCC
CTCCGGTCTC TGCCCGCCCG TACCGCGAGA CCGGGCCGGC TCGACGCGGC CGGTCAGATG
CTGGCGATCC TGGCGCTGGC CGGGTTGACG TTCGCGATCA TCGACACCGG CGACCACGGC
CTCACCGCCC GTGCGGCCGC CGGGTTCGCT GTCGCGGTCC TGGCTGCAGT CGGGTTCGTG
TGGCACGAGC GCCGCAGCCG GACGCCGATG CTGCCACTGT CGATCTTCTC CGCGCCCGGA
TTCTCCACGG CGACGGTGGT CGGGTTCGTG CTGAACTTCA GCTTCTTCGG GCAACTGCTC
GCACTCACCC TGTACATCCA GGACACCCGT GGCCTCGCAC CCGCGATCGC GGGGCTCGTC
ATGGCCCCGC AAGCACTCGG CGCGATCATC GGCGCCCCGC TCGGCGGCCG CATCACCGCC
GCACACACCC CACAGCGGGC GATGCTCACC GGCCTCGCGA TCGGCACGGC AGGATTCGCG
AGTCTGATGA TCTTCGACAC CTCTACCCCT TACCCGGTGG TGGCGATCCT GACATTCGTC
GCAGGGCTGG GAATGGCGAT CGCGATGCCC GCGGCGACCA GCGCCGCGGT CTCCGCCGCC
CCGGACACCC TGACGGGGAT CGCCGGAAGC GTGATCAACG CCGCCAGACA GACCGGCAGC
GTCGTCGGCG TCGCCGTGCT CGGCAGCCTC GCAACCGGGT TCGGCAACAT CACCGGCTTC
CGAGCCGCAG CCCTCGGAGC GGCGATCGCC TTCGCTCTCG GCCTCGCTCT CGTCCTCTGG
AACGCCGTGA ACAAGCAGTC GCTCTATTCC TGA
 
Protein sequence
MGQRASSAGT WVVVAGYFLV MLDTTIVNIA LPHLGTGLSV SPGGLAWIMD AYTLVFAALL 
LPAGSACDQY GARRVYLTGI AVFALASIAC ALAPNAGLLI ASRAIQGIGA AAVVPATLAL
ITELFTDPAA RATAVGLWGA AGGVAAAVGP LLGGVLLDGI GWRAAFWVNV PVVVAIAIGA
LRSLPARTAR PGRLDAAGQM LAILALAGLT FAIIDTGDHG LTARAAAGFA VAVLAAVGFV
WHERRSRTPM LPLSIFSAPG FSTATVVGFV LNFSFFGQLL ALTLYIQDTR GLAPAIAGLV
MAPQALGAII GAPLGGRITA AHTPQRAMLT GLAIGTAGFA SLMIFDTSTP YPVVAILTFV
AGLGMAIAMP AATSAAVSAA PDTLTGIAGS VINAARQTGS VVGVAVLGSL ATGFGNITGF
RAAALGAAIA FALGLALVLW NAVNKQSLYS