Gene Ksed_04020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_04020 
Symbol 
ID8371912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp388272 
End bp390101 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content66% 
IMG OID644990698 
Productcholine/carnitine/betaine transport 
Protein accessionYP_003148242 
Protein GI256824282 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.0187444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0477577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACAC ACACCGACCA CGATCGAGGC ACCCACATGT TGCAGAAACT GCACGACGCG 
ATGGGTCTGC GGACCTCACC GACGATCTTC TTCGGCTCGT TCGCCGTCGC GATCGTCTTC
ATCGTCATCA CGTTGTCCTT CCCCGGGACG ATGGCCAACA CCTTCGAGTC CGGGTCGGAC
TGGCTGCTGG GCAACCTCGG CTGGTTCTAC ATCCTCGGCG TGACCGTCTT CCTGATCTAC
CTGATCGTCA CGGCCATGAG CCGTTTCGGA CGCATCCGGC TCTCCCCCCA CGACGAGGCG
CCCGACCACT CCTTCCCGGC GTGGTTCGCC ATGCTGTTCG CGGCGGGTAT CGGCACCATC
CTGATGTTCT GGGGTGTGGC CGAGCCGGTG AACCACTTCG CCAACCCGCC GATGGCGGAC
GTGGAGCCCG AGAGCGTGGA CGCCGCCCGC GAGGCCATGA GCTTCACGCT CTACCACTTC
GGGCTGCACA CCTGGACGAT CTTCGCCCTG CCGTCGCTGG CCTTCGCGTA CTTCATCTAC
CAGCGCAACC TGCCCCCGCG CGTGTCCTCC CTCTTCCACC CGCTGCTGGG TGACAAGGGC
ATCCACGGCC CCATCGGCAA GACCATCGAC ATCGTCGCGA TCGTCGGAAC GCTGTTCGGT
GTGGCCGTCT CCATCGCGCT GGGCACGCTG CAGATCAACG CCGGCCTGAG CGCCGTGCTG
GGCATCGACC AGTCCACGCT CAGCATCCTG CTCATCGTCG GCGTGGTGAC GGTCCTGGCC
CTGTGCTCGG TGATCGCCGG CCTGGACAAG GGCATCAAGG TGCTGTCGAA CGCCAACATC
CTGGCCGCCG TGGGCCTGAT GGTGTTCGTC CTGATCTCGG GCCCGACGCT GCACCTGCTG
CGCGGCACCA TCGAGGGCGT GGGCCTGTAC GCCCAGAACC TGCCCTACCT GGCCTTCTGG
AACGACTCGT TCGACGACAA CCCCGGCTGG CAGGACGGCT GGACCATCTT CTACTGGGCC
TGGACCATCA CCTGGTCGCC CTTCGTCGGC ATCTTCATCG CCCGCATCTC CCGCGGTCGC
ACCATCCGCC AGTTCATCGT CGGCGTGCTC GCCGCCCCGG TGAGCTTCTC CATCATCTGG
TACGGCATCT TCGGCTTCGC CAGCTTCGAC ATCATCCGCA ACCAGGAGGG TGGCGGCCAG
CTGGTCGACT CCGTGCTCAA CGACGGTGCC GAGGTGGCGC TGTTCGAGTT CCTGGAGCAC
TTCCCGTTCA CCACGTTCAT GTCGGTCTTC TCGATCGCCA TCGTGGCCAT CTTCTTCGTG
ACCTCGATGG ACTCGGCCTC GCTGGTGATG GACTCCATGG CCCGCGGTCA CGACGAGGAC
GAGCGGGTGC CGGTCCTCCA GCGGATCATC TGGGCCATCA CCGTCGGTGC CATCGCGGCG
GTGCTGCTGA CCTTCTCGCC GGATGCCGGC ATCTCGGCCC TGGAGGACGT CATCACGATC
GTGGGCCTGC CCTTCTTCGT GATGGGGTAC CTCATGATCT GGGCGCTGAA CCGGGCCATG
AAGGAGGACG CCGGTGAGCT CCTCCCGCTG GCCACGCGCC GGTACCGCAA GGTGCTGCCG
CCCGAGGAGG TCGAGCGTCG CCGCGCCGAG GGCGACGAGG CGTGGTCCGA CACGGCCCTC
GAGCAGGACC CGCACTACAT GGACAGCGAG GGACACGTCA TCGCGGCCCC CGAGACCGCG
GCCTACGAGC AGCACGAGCT GTACGAGGCC GGGGAGGACC GTCCCTCCGA GGAGGTGACC
GGCTCCAGCC GGTTGAACGG CACCGCCTGA
 
Protein sequence
MTTHTDHDRG THMLQKLHDA MGLRTSPTIF FGSFAVAIVF IVITLSFPGT MANTFESGSD 
WLLGNLGWFY ILGVTVFLIY LIVTAMSRFG RIRLSPHDEA PDHSFPAWFA MLFAAGIGTI
LMFWGVAEPV NHFANPPMAD VEPESVDAAR EAMSFTLYHF GLHTWTIFAL PSLAFAYFIY
QRNLPPRVSS LFHPLLGDKG IHGPIGKTID IVAIVGTLFG VAVSIALGTL QINAGLSAVL
GIDQSTLSIL LIVGVVTVLA LCSVIAGLDK GIKVLSNANI LAAVGLMVFV LISGPTLHLL
RGTIEGVGLY AQNLPYLAFW NDSFDDNPGW QDGWTIFYWA WTITWSPFVG IFIARISRGR
TIRQFIVGVL AAPVSFSIIW YGIFGFASFD IIRNQEGGGQ LVDSVLNDGA EVALFEFLEH
FPFTTFMSVF SIAIVAIFFV TSMDSASLVM DSMARGHDED ERVPVLQRII WAITVGAIAA
VLLTFSPDAG ISALEDVITI VGLPFFVMGY LMIWALNRAM KEDAGELLPL ATRRYRKVLP
PEEVERRRAE GDEAWSDTAL EQDPHYMDSE GHVIAAPETA AYEQHELYEA GEDRPSEEVT
GSSRLNGTA