Gene Mkms_4573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4573 
Symbol 
ID4612517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4803833 
End bp4805236 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content68% 
IMG OID639794260 
Productethanolamine transproter 
Protein accessionYP_940554 
Protein GI119870602 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID[TIGR00908] ethanolamine permease 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0966577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACCA CGCACAGCGA ATCCAGCGAC TACCTGGCCA AACGGACACT CAAGCAGGGC 
ACTGCCGGCT GGCTCCTCCT TGCCGGTCTC GGCGTCGGAT ACGTGATCTC CGGCGACTAC
TCGGGATGGA ACTTCGGGCT CGCCGAGGGC GGCTTCGGTG GGCTGCTGAT CGCGGGCGTG
ATCATCGCCG GCATGTACCT GGCGATGGTG TTGGGGATGG CCGAGATGTC CTCGGCGCTG
CCTGCCGCGG GCGGCGGCTA CACCTTCGCG CGCCGCGCGC TCGGACCGTG GGGCGGATTC
GCCACGGGTA CAGCCATTCT CATCGAGTAC GCGATCGCCC CGGCCGCCAT CGCCACATTC
ATCGGCGCCT ACGTCGAATC GCTCGGGTTG TTCGGTATCA CCGACGGCTG GTGGGTGTAC
CTGGCGGTGT ACCTGCTGTT CATCGGGATC CATCTCAGCG GCGTCGGTGA GGCGCTCAAG
GTCATGTTCG TCATCACGGC GATCGCGCTG GCCGGCCTGA TCGTCTTCGC CATCGGGGCG
GTCGGCCGCT TCGACGCCGC GAACCTCACC GACATCGCGC CCACCGACGC GGCGGGCGCG
TCGTCGTTTC TGCCGTTCGG CTACCTGGGG ATCTGGGCGG CCGTACCGTT CGCGATCTGG
TTCTTCCTCG CCATCGAAGG CGTGCCGCTG GCCGCCGAGG AGGCCAAGGA TCCGTCCCGC
AACGTCCCGC GGGGAATCCT CGCCGCGATG GGCGTGCTGT TGGTGACCGG CAGCACCGTG
CTGGTGCTGG CGGCCGGCTC CGGCGGGGCG GAGCTGATCA GCGCCTCGGG CAACCCGCTT
GTGGAGGCGC TCGGCGACAC GACGACGTCG AAGGTGGTCA ACTACATCGG CCTGGCCGGG
TTGGTCGCGA GCTTCTTCTC GATCATCTAC GCCTACTCGC GTCAGCTGTT CGCGCTGTCC
CGCGCGGGGT ATCTGCCCAG GCGGCTGTCG GTGGTCAACT CGCGCAAGGC GCCGACGCTG
GCCCTGGTGG TGCCCGGGAT CATCGGCTTC ATCCTGTCGT TGACCGGGCA GGGCGCCATG
CTGCTCAACA TGGCGGTGTT CGGCGCCGCG CTGTCCTACG TGCTGATGAT GGTCAGCCAC
ATCGCGCTGC GGGTGCGCGA ACCGGACATG CCCCGGCCGT ACCGCACGCC GGGCGGCGTC
GTCACCACCG GTTTCGCACT CGTCATCGCC GCCCTCGCGG TGGTCGCGAC CTTCCTCGTG
GACAGCACCG CGGCCACCTG GTGCCTGGTG GTGTTCGCCG CGTTCATGGC CTACTTCGGC
CTCTACAGCC GCCACCACCT GGTGGCCAAC TCGCCCGACG AGGAATTCGC CGCGCTGGCC
GATGCGGAGA AGGACATCCT TTGA
 
Protein sequence
MDTTHSESSD YLAKRTLKQG TAGWLLLAGL GVGYVISGDY SGWNFGLAEG GFGGLLIAGV 
IIAGMYLAMV LGMAEMSSAL PAAGGGYTFA RRALGPWGGF ATGTAILIEY AIAPAAIATF
IGAYVESLGL FGITDGWWVY LAVYLLFIGI HLSGVGEALK VMFVITAIAL AGLIVFAIGA
VGRFDAANLT DIAPTDAAGA SSFLPFGYLG IWAAVPFAIW FFLAIEGVPL AAEEAKDPSR
NVPRGILAAM GVLLVTGSTV LVLAAGSGGA ELISASGNPL VEALGDTTTS KVVNYIGLAG
LVASFFSIIY AYSRQLFALS RAGYLPRRLS VVNSRKAPTL ALVVPGIIGF ILSLTGQGAM
LLNMAVFGAA LSYVLMMVSH IALRVREPDM PRPYRTPGGV VTTGFALVIA ALAVVATFLV
DSTAATWCLV VFAAFMAYFG LYSRHHLVAN SPDEEFAALA DAEKDIL