Gene Mlg_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1153 
Symbol 
ID4270659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1350000 
End bp1351424 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content67% 
IMG OID638125902 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_741992 
Protein GI114320309 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGG TAGTCAAGGA CAACGAAGAG CGCACGCCTT CCGTGACCAT GAACCATTTC 
TCGGTCATCC AGCAGATCGC CGAGCGCACC AGCGGCGTGG GTGTGGAGGC GGTGGACATT
GTCAACCACA TCGAACACGT CGCCCAACTG TTCCGGCGCC AGGCGGCCAT ATTCTCCGAC
CTGGTGGGCA CCGCGCGGCG CATGAGCGAG GCCAACGACA CCGTCGACCA AGCCGCCCGC
CGCGCCCACG ACGTGGCGGC CAGCGCCGGC AAGGACGTGG AGCGCTCACG CGCAACCATC
ACCCACTCGC TGGAGCAGAT CCGCGGGCTG GCCGAGTCGG TCACCCATAT CGAAGGCCAG
TTGAGCAGCC TCAACGAGGC GCTGCAGGGC GTGGCCCAGG TGGCCAATGA GATCAGCACC
ATTGCCAAGC AAACCAATCT GCTCGCCCTC AACGCCACCA TCGAAGCCAC CCGCGCCGGC
GAGGTGGGCC GCGGTTTCGC CGTGGTGGCG GAGCACGTCA AGGAGCTGGC CAAGCAAACG
GCGGATGCCA CCTCCGAGAT CCATCAGATC CTGGAGGAGT TGACCAACAT CATCGAGCGG
CTGATCCGGC GCGGTGCCGA GAGCACGGAG AAGGCCCAGG CGGTGCGTGA GGGCACCCAC
GCCATCCAGG AGATCATGGA GACGGTGGGC AGCGCCATGC AGGACGTGGA CACCGAGTCC
GGGCGCATCC ATGAGTCCGT CGGCGAGATC GACCAGCACT GCCGCAGGAC GGTGGGAGGG
CTCGAGGAGC TCACCAGCGA GGTCCGGCAC GCCAACGATG CATTGGAGGA GGCTGAGGCA
CGGACGGAGA AGCTGCGTCA TTACACCGAG CTACTGATCC GCGAAACGGC CGTGGAGGGC
GTGGAGACCA TCGACACGCC CTATATCCGC TTGGCCGAGC AACTGGCCGC GGACACCGCC
GCCGCCTTCG AGGAGGGCCT GAAGCGCGGC AGTGTCAACG AACAGGCGCT GTTCGATCAG
AACTACCAGC GCGAGGCCGA CACCGCGCCA CCCCGTTTCA CCGCCGCCAA CTCCGAGTTC
TGTGACCGGG TGCTGCCGGC CATCCAGGAA CCGGCCCTGC AGCGCCAGCG CCGGGTGCTG
TCCGCCTTCG TCTGCGACCT GGGCGGGTAT GTGCCGGCAC AAAGCCGTGA GGCCGCCCGC
GCCCCGTCAG CGGACGACAG CGCCGAACTA CCGCGGTTTC ATCGGCGACG GCTGGACAGC
CGCGAGACCC GTGCCGCGAC CCGCAACCGC GAGCGTTTCC TGCTGCAGAC CTACCGCCTG
AATCTGGGCG AAGGGCACCA TGCCCTGGTC AAAGAGGTCT CGGTGCCCAT CAAGGTGGGC
GGGCGGCACT GGGGGTGTCA GCGGCTGATC TACGCGGCGG ACTGA
 
Protein sequence
MASVVKDNEE RTPSVTMNHF SVIQQIAERT SGVGVEAVDI VNHIEHVAQL FRRQAAIFSD 
LVGTARRMSE ANDTVDQAAR RAHDVAASAG KDVERSRATI THSLEQIRGL AESVTHIEGQ
LSSLNEALQG VAQVANEIST IAKQTNLLAL NATIEATRAG EVGRGFAVVA EHVKELAKQT
ADATSEIHQI LEELTNIIER LIRRGAESTE KAQAVREGTH AIQEIMETVG SAMQDVDTES
GRIHESVGEI DQHCRRTVGG LEELTSEVRH ANDALEEAEA RTEKLRHYTE LLIRETAVEG
VETIDTPYIR LAEQLAADTA AAFEEGLKRG SVNEQALFDQ NYQREADTAP PRFTAANSEF
CDRVLPAIQE PALQRQRRVL SAFVCDLGGY VPAQSREAAR APSADDSAEL PRFHRRRLDS
RETRAATRNR ERFLLQTYRL NLGEGHHALV KEVSVPIKVG GRHWGCQRLI YAAD