Gene Mlg_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0698 
Symbol 
ID4268857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp777079 
End bp778545 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content64% 
IMG OID638125447 
Productflagellin domain-containing protein 
Protein accessionYP_741542 
Protein GI114319859 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.911513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.224965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TCATCAACAC CAACATCGCT TCCCTGAACG CCCAGCGCAA TCTCAACGCC 
AGTCAGGGCC AGTTGGGCGT GGCCCTGGAG CGGCTGAGCT CGGGACTGCG GATCAACAGC
GCCAAGGACG ACGCGGCGGG GCTGGCCATC AGCAGCCGCT TTACATCGCA GATCAACGGC
CTGGACCAGG CCGTGCGGAA TGCCAACGAC GGCATCTCCT TTGCCCAGAC CGCGGAGGGG
GCACTGGACG AGAGCACCAA CCTGCTGCAG CGGATCCGCG AGCTGGCGGT GCAGGCGTCC
AATGACACCA ACTCCGCCTC CGATCGCGAT GCCCTGGACC AGGAGGTGCA GCAGGCCATC
CGCGAGATCA GCCGCATTGC CGCCTCCACC CAGTTCAACA ACCAGAACAT CCTGGACGGC
TCGCTCAGCG ACCTGATCTT CCAGGTGGGC GCCAACCGGG GGCAGATCAT CTCGGTGGAC
GGGGTGGATG CCCGGGCCCA GTCGCTGGGG GCGCAGGTGG CAGACTCCGG TGCGGTGGCC
ACCAACACCT TGTCCGAGGG TGGAGAGCTC TCCATTGCCG GGGTGACCAT CGATATGGAT
GGTGCCGAGA ACATCAGCGA TGTGATCAGC CGAATCAACG ACAACTTCTC CGAGACGGGG
GTGCAGGCCT TCCAGACCTC CGGTGGGTCC ATCGTTGCGG AAACGGGGCT AAGTGAGGAT
GAGATCTCGT TTACCGCGGA TCCCGACCCG GTGCAGCTGA TGACCATCAA TGGGGTCAAT
GTCTTCTCGG AGGCCGGGGC GGGCATCGAC AATCCGCAGG CGTTGGCGGA GCGGATCAAT
GCCTATACGC CGTTGACCGG CGTGTCGGCG GTGGACGTGG ACGGGGAACT CGTCCTTGAG
AGCCAGGCGG ATCAGGAGTC GATCCGGGTT ACGGATGTCA ATGCCGACCT GTTTGAGGCA
ACCACCACGG ATTTCGGTGA GGAAACCGTC TTTTTCGATG AGAATGACGA CCTCCGGACC
GAGTTGACCT TCGAGCGTGG CTTCGACCTA CGGGTGCCCC TGGAGGCCGA CCCTCCGGTG
ATTGCCGATG ACAGCGATGG GGATCTCCTT CAGCGCCTGG GTCTGGTCGG TGGCGCCGAT
CGCTTCGAGA CCTTCAGCGC GGACACCGTC AGTGTCGCCA CCCGCGGCGA GGCGCAGGAC
GCCATCCGCA CGGTGGACAT CGCCCTGCAG GAGATCAACG GCATCCGCGC CGACCTGGGT
GCGGTACAGA ACCGCTTCGA GGCCACCACC GCCAACCTGA CCATCACCTC GGAGAACCTC
AGCGCCTCGC GCAGCCGCAT CATGGATGCC GACTTCGCCG CCGAGACCGC CGCACTGACC
CGCGGCCAGA TCCTGCAGCA GGCCGGTACC TCGGTGCTGG CGCAGGCCAA CCAGCTACCC
AACAACGTCC TCAACCTGCT GCAGTAA
 
Protein sequence
MAQVINTNIA SLNAQRNLNA SQGQLGVALE RLSSGLRINS AKDDAAGLAI SSRFTSQING 
LDQAVRNAND GISFAQTAEG ALDESTNLLQ RIRELAVQAS NDTNSASDRD ALDQEVQQAI
REISRIAAST QFNNQNILDG SLSDLIFQVG ANRGQIISVD GVDARAQSLG AQVADSGAVA
TNTLSEGGEL SIAGVTIDMD GAENISDVIS RINDNFSETG VQAFQTSGGS IVAETGLSED
EISFTADPDP VQLMTINGVN VFSEAGAGID NPQALAERIN AYTPLTGVSA VDVDGELVLE
SQADQESIRV TDVNADLFEA TTTDFGEETV FFDENDDLRT ELTFERGFDL RVPLEADPPV
IADDSDGDLL QRLGLVGGAD RFETFSADTV SVATRGEAQD AIRTVDIALQ EINGIRADLG
AVQNRFEATT ANLTITSENL SASRSRIMDA DFAAETAALT RGQILQQAGT SVLAQANQLP
NNVLNLLQ