Gene Ksed_04170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_04170 
Symbol 
ID8371927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp403939 
End bp405438 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content70% 
IMG OID644990713 
Productsodium/proline symporter 
Protein accessionYP_003148257 
Protein GI256824297 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0302673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACC CCACCTGGCA ACTGATCGCC ATCGTCCTGT ACTTCGCCGC CATGGTCGGC 
ATCGGCGTCT GGTCCTTCAC CCGCACCGGC GACAAGGGGG ACTACATGCT CGGAGGGCGC
GGCCTGGGCC CGGTGGCCTC CGCCCTGTCC GCCGGCGCCT CCGACATGTC CGGCTGGCTC
CTGATGGGGC TGCCCGGCGC CCTGTACCTC TCGGGCCTGG TCGAGCTCTG GATCGCCATC
GGCCTGACGG CCGGCGCCTG GCTGAACTGG AAGTTCATCG CCCCGCGGCT GCGCACCTAC
ACGGAGGTGG CCGACGACGC CATCACCGTG CCCAGCTTCT TCAGCAACCG CACCGACGAC
CACGCCGGGC TGCTGCGCAT CACCGCCGGT GTCATCATCC TGGTCTTCTT CACCTTCTAC
GTGTCCTCGG GCATGGTCGC CGGAGGGCGC TTCTTCCAGG CCAGCTTCGA CACCAGCTAC
ACCACCGGCA TGCTGGTCGT CTCCGGCATC GTCATCCTCT ACACCCTGAT CGGCGGCTTC
CTCGCGGTGT CCTACACCGA CGTGGTGCAG GGCCTGATGA TGCTGGCAGC CCTGATCCTC
GTGCCGATCG CCGGTGTGGT GCACCTGGGG GGTCTCGGTG ACACGGTGGA CGCGATCCGC
TCGGTCGACG CGCACGCCCT GAGCCTGTTC GGCGGGGGCC TGACCACGAT GGCCTTCATC
TCGGCCGTGG CGTGGGGCCT GGGCTACCCG GGCCAGCCGC ACATCATCAC CCGCTTCATG
GCGCTGCGGT CGCCGCGGCA GGCCCGGTCG GCACGCCGCA TCGGCGTGGG CTGGATGGCG
CTGGCCTGCC TGGGCGCCGC GGCCACCGCG CTGGTGGGCA TCGGCGTCTT CCAGCGCGAG
AGCGAGCAGC TCACGGACCC GGAGACCGTC TTCATCGACC TGGGTGTCCT GCTGTTCCAC
CCGTTCGTCG CCGGCCTGAT GCTGGCCGCC ATCCTCGCGG CGATCATGTC CACCATCTCC
AGCCAGCTCA TCGTCTCAAG CTCGGCGCTG GTGGAGGACA TCTACCTCGG GATCACGGGC
AAGGAGCTGC GCGGCAACAT CGGCGCCCAC CTGGGCCGGG TCGCGGTGCT GGTGATCGCG
CTGGTGGCCG GTGCCCTGTC GCTCAACCCG AGCGACACGA TCCTGGACCT CGTGGCCTTC
GCGTGGGCAG GCTTCGGTGC CTCCTTCGGG CCGATCGTGA TCCTCGCCCT GTACTGGCGG
CGCCTCACCA CCCTGGGCGC GCTAGCCGGC ATGGTCACCG GTGCGGTGGT CTCGTTCGGC
TGGGGCCAGC TCGAGGGTGG GCTGTTCGAC CTCTACGAGA TCGTGCCCGG CTTCGCGCTG
AACCTCCTGG TGACCGTCGT GGTCTCGCTG CTGACCCGGC AGCCCGGCCC GGAGGTGCGG
GCCGAGTTCG ACGAGGCCGT CCGGCGGGCC GAGGCGCAGG AGGAGACCGT TCCGGCCTGA
 
Protein sequence
MSDPTWQLIA IVLYFAAMVG IGVWSFTRTG DKGDYMLGGR GLGPVASALS AGASDMSGWL 
LMGLPGALYL SGLVELWIAI GLTAGAWLNW KFIAPRLRTY TEVADDAITV PSFFSNRTDD
HAGLLRITAG VIILVFFTFY VSSGMVAGGR FFQASFDTSY TTGMLVVSGI VILYTLIGGF
LAVSYTDVVQ GLMMLAALIL VPIAGVVHLG GLGDTVDAIR SVDAHALSLF GGGLTTMAFI
SAVAWGLGYP GQPHIITRFM ALRSPRQARS ARRIGVGWMA LACLGAAATA LVGIGVFQRE
SEQLTDPETV FIDLGVLLFH PFVAGLMLAA ILAAIMSTIS SQLIVSSSAL VEDIYLGITG
KELRGNIGAH LGRVAVLVIA LVAGALSLNP SDTILDLVAF AWAGFGASFG PIVILALYWR
RLTTLGALAG MVTGAVVSFG WGQLEGGLFD LYEIVPGFAL NLLVTVVVSL LTRQPGPEVR
AEFDEAVRRA EAQEETVPA