Gene Mlg_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0703 
Symbol 
ID4268092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp784745 
End bp786178 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content68% 
IMG OID638125452 
Productflagellar hook-associated 2 domain-containing protein 
Protein accessionYP_741547 
Protein GI114319864 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0307046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.26193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTA TCAGTTCCCT GGGTGTGGGG TCGGGTCTGG ATATCCGCGA TCTGGTCGAC 
CAGCTGGTGG CGGCCGAGCG CCAGCCCGGT CAGAACCGGC TGGACCGCCA GGAGTCGCGG
CTGGAGGCCC AGATCTCCGG GCTGGGCCGG CTGCAGCAGG CCATCACCGA CTTCGGCGGT
GCCCTAGGCA ACGTCTCCGC TGAGAGCGAC TTCCGCAGCG TGGTGGCCAC CAGCAACAAC
GAGTCGGCGG TCAGCGTCTC CGCCGGCCGC GACGCCCCAC CCGGCAGCTA CGACGTCAAC
GTCACTCAAC TGGCCCAGGC CCAGCGGCTC GCCACCAATT CGGACCTGTT CGAGGACGTG
GAGGACTTCT CGGCGGGCAC CACCTCGCTG GGCACCGGCA GCTTCACTAT CGAGTACCAG
GACGGCAGCG CCGAGACCTT TCAACTGGAG GAAGGGGCTG ACACCCTGCA GGACGTGCGC
GCCGCCATCA ACAACCAGAG CGAGAACGTG CGTGCCTCGG TGGTGGACGA CGGCGAGGGC
CCGCGGCTGG TGATCACCAG CCGGGAGACC GGCGACCAGA ACGCCGTCTC GGCGATCACC
GTCGATCCCG ACGACCCGGA CAGCGACCCG CTGCTGGAGC GGCTGGCATT CGATGCTGAG
AACCTGGCCG AGCCGGACGA GGATGGTGTG CGCGCCGGCG ACAACTTTTC CCAGTTGCGC
GCTGCGCAGG ACGCCGAACT GTTCGTGGAC GGACTGCGAA TCACCCGGCC CGGCAATGAG
ATCAGCGGCG TGATCGATGG CGTCACCCTC ACGCTCAGCG CGGTGGACAG CGCCCGGATC
AACGTGGCGG AGGAGCCCGG GTCGGCCGCC TCCGCGGTGA GCGACTTTGT CGGGGCCTAC
AACAGCCTGC AGCGGACCCT GGGTGAGTTG AACGCCTTCG ACCCGGAGTC CGGCGAGGCG
GGTGAGCTCA AGGGCGACTC CACCCTCCGC TCGGTGCAGG CCCGCCTGCG CCAAATGATC
AGCGAGCCGC TGCCGGGGGC GGGCGGGCCG GTGCAGACCC TCGCGGACCT GGGCATCACC
ACCCGCCGGG ACGGTACCCT GGAGATCAAC GACGCCCGGC TGGATGATGC GCTGAGCGAG
AACCGGCTGG ATGTCATCCG TCTGTTCACG GACGAGGAGA ACGGGCTGGC CGCGCGCCTG
CAAGGCGCCG TGGATGAGTT CACCGGCCGG GACAGCGTGA TCAACAGCCG CACCGAGTCA
CTGCAGGACC GGCTCGCCGC CCTGGCGCCA CAGCAGGAGC GCCTGGACCG GCGGATGGAT
CAGCTGGAGG CGAGGCTGAT CCGCCAGTTT TCCGCCATGG ACTCGATGAT CGCGCAGATG
AACCAGACCA GCGAGTTCCT GGATAACCAG CTTAACCTGT TGAATCAGCA ATAG
 
Protein sequence
MASISSLGVG SGLDIRDLVD QLVAAERQPG QNRLDRQESR LEAQISGLGR LQQAITDFGG 
ALGNVSAESD FRSVVATSNN ESAVSVSAGR DAPPGSYDVN VTQLAQAQRL ATNSDLFEDV
EDFSAGTTSL GTGSFTIEYQ DGSAETFQLE EGADTLQDVR AAINNQSENV RASVVDDGEG
PRLVITSRET GDQNAVSAIT VDPDDPDSDP LLERLAFDAE NLAEPDEDGV RAGDNFSQLR
AAQDAELFVD GLRITRPGNE ISGVIDGVTL TLSAVDSARI NVAEEPGSAA SAVSDFVGAY
NSLQRTLGEL NAFDPESGEA GELKGDSTLR SVQARLRQMI SEPLPGAGGP VQTLADLGIT
TRRDGTLEIN DARLDDALSE NRLDVIRLFT DEENGLAARL QGAVDEFTGR DSVINSRTES
LQDRLAALAP QQERLDRRMD QLEARLIRQF SAMDSMIAQM NQTSEFLDNQ LNLLNQQ