Gene Mlg_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2831 
Symbol 
ID4270875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3214889 
End bp3215977 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID638127593 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_743661 
Protein GI114321978 
COG category[N] Cell motility 
COG ID[COG1360] Flagellar motor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.227134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAGG ACACCCGCCG CCGACGCCGG GGGCTGAACA TCTGGCCGGG CTACGTGGAC 
GCCCTGGCCA CCCTGCTGCT GCTGTTCGTC TTCGTGCTGT CGCTGTTCAT GGTGGCCCAG
TACGTCCTCA GCGACGCCCT GTCCGGGCGC GAGGCGGCGT TGGCACGGCT GCAGGCCGAT
ATCGACGCCC TCACCGAGAT CATCGCCCTG GAACGGGAGG AGCGTGCCGG GGTGGAGGAG
GAGCTGGCCG AGCTGGAGGC CCGGCTGGTC GCCACCCTCG CCGAGCGCGA TCAGGCACGC
GCCCGGGTCA GCACCCTGGA GGCACAGCAG GCCGCCCTCG AGGACAGTCT GGCCGATCAG
GAGGAGGCCC TGGACGAGGC GGCGGCGCGC CGCGCGGAGC TTCGGGACCG GCTGGCCGGC
CGCGAGGCGG ACCTGGCCCG CGAGCGCGCC CTCACCGACG AACAGGCCGC CCGCATCGAC
CGGCTGCACC GGCAGATCAT CGCCCTGAGA GAGCAGCTCA CCGCCCTGTC CGAGGCACTC
GACCTCAGCG AGGCCACCGC CGCGGCCCAG CGCGCCGAGA TCCGCGATCT CGGCCAGCGC
CTCAATCTGG CGCTGGCGGA GCGGGTGCAG GAACTGGCCC GCTATCGCTC GGAGTTCTTT
GGTCGGCTGC GCGAGGTACT GGGTGATCAC CCGGACATCC GGATTGAGGG CGACCGCTTT
CTGTTCCAGT CCGAGCTGCT GTTCGCTACC GCCTCGGCGG ATCTCGGTGG CGAGGGCCGG
GAGCAGCTCG AGGGGCTGGC CACCACCCTG CACGAATTGC GCGGGCGCAT CCCGGACGAC
CTGGACTGGG TCCTGCAAGT GGAGGGCCAC ACCGATCGCC GCCCCATCCG CACCGCCGAG
TTCCCGTCCA ACTGGGAGCT CTCCACCGCC CGCGCCCAGA CCATCGTGCG CTACCTGATG
GACCAGGGCA TCCCGCCGGA ACGGCTGGCC GCCGCCGGCT TCGCCGAATA CCATCCGGTG
GACGACCGCG ACACCCCGGA GGCCTGGGCC CGCAACCGGC GCATCGAACT GCGATTGACC
AACCGCTAG
 
Protein sequence
MLEDTRRRRR GLNIWPGYVD ALATLLLLFV FVLSLFMVAQ YVLSDALSGR EAALARLQAD 
IDALTEIIAL EREERAGVEE ELAELEARLV ATLAERDQAR ARVSTLEAQQ AALEDSLADQ
EEALDEAAAR RAELRDRLAG READLARERA LTDEQAARID RLHRQIIALR EQLTALSEAL
DLSEATAAAQ RAEIRDLGQR LNLALAERVQ ELARYRSEFF GRLREVLGDH PDIRIEGDRF
LFQSELLFAT ASADLGGEGR EQLEGLATTL HELRGRIPDD LDWVLQVEGH TDRRPIRTAE
FPSNWELSTA RAQTIVRYLM DQGIPPERLA AAGFAEYHPV DDRDTPEAWA RNRRIELRLT
NR