Gene Mlg_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1232 
Symbol 
ID4269016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1436029 
End bp1438449 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content70% 
IMG OID638125982 
Producthypothetical protein 
Protein accessionYP_742071 
Protein GI114320388 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.755816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.21821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTA AACGGGTCTT CATCTCCGCC GTTGCCGGGT TGACGTTTTC CTCCACCGTC 
CTGGCCCTGG GCCTGGGCAC CATCGAGCGT GACTCCTGGC TCAATCAGCC GCTCTCCGCG
CGGATTCCGC TGCACTCGGC GGACACCGTT AGCCTGGAGG CGTTGCAGGT CACCATGGCC
TCGCAGGAGA CCTTCGAGCG CGCCGGGCTC GACCGGCCTC CCTACCTGCG CGATATCCGC
TTTGAGGTGA TCGAGGACCA CACCGAGGGG CCCCATATCC GGGTCTATAC CCGGGATCCG
TTCCGCGAGC CTTTCGTTGA TTTCCTCGTG GAGTTGAACT GGCCGCAGGG CCGTTTGGTG
CGGGAATACA CCCTGTTGCT GGACCCGCCC CGCGACCCCG CGGAGCGGCC GGTGGCGGCC
ACGCCGCCGC GGACCGAACC GGCGGAGGTC GAGGAACAGC CACTGGAAAC GGCGTCCCCC
ACGGTTCGGG ACGGCCACTA CGGGCCGGTG GCCGGCGGTG AGACCCTGTG GTCGGTGGCC
GACCGGGTGC GCCATCAGGG GGTCAGTGCC CAGCAGATGG CCCTGGCCCT GTTCGAGGCC
AACCCGAGGG CCTTTGTCGC CAACGATATC AACCGGCTAC AGGCCGGCGC CACGCTGACT
GTGCCCGATG CCGAGGGGGC CCGGGCCTAT TCCCGCAACG AGGCCGTGCG CCGCTTCAAT
GCGCTGGCCG AAGGCGTGGC GGAGGAGCCG GCGGAGGAGC CGCCCGTAGC GGAGGCGCCG
GATCCGGAGA CGGTGGACCG GCAATTGCAG ATCCTCGCCG AGTCGGAACG CGACGAAGAG
GCCCTGGCGA GCCTTCTCGA TGGTGATGTG GAACCCAGCG AGGAGAATCT GGGCGCGCTG
CGCGAGGAGC TGCTGCGTGC CCGCGAGGAC CAGGCCAGCC TGCGTTCGGA GAACGAAGCC
CTGCGCGAGC GGGTGTCGGA GATGGCGCAG GAGATCGAGC GACTGGAGCG GCTGCTCACC
CTGGACGTGG AGACCGGGGT GCTGCCGATG GTGCCGAGCG AGCAGCCTGA AGGGGCCGTG
GCCGAGCCTC CACCGGAGCG CCCGGCGGTG GTGGATGAGC CCGAAGTCGA GCCGGAGATC
ACTGAGGAAA CGCCGGCGGA GGACGATGAG GCGCCGGTGA CGGCGCCCCC GCCGGCCCCC
TCTTCACCGC AACCGCTCGC CTGGCTCGAC GGGCTCAGCC TGCCTGTGGC GCTGGCGGGC
GTCGGCATCC TGATCGGGCT CCTGGCCCTG CTGCTCATAC GGCGCCGGCG GACGGAGACC
GGCGTGGCCG CGGAGGAGGT GCCCATTCAA CAGCCGCCGG CGCGAGAAGG CGTGGCGGCC
GGGACGGCGG CCGCGGCGAG CGCGGTGGCC GACGGGGACG AACCGGAGCC TGAACCTGCG
CCTGAGCCCC GCTCCGAGCC CCAGGATCCC CTGCAGGTGG CCGATGAGTT GGCAGACCGG
GGCGATCTGG AGGGTGCCCG CGCCACCCTG CTGACGGCGC TCAACAATGT GCCGGCCCGC
AACGAGTACC GGGTGCGCCT ATTGGAGATC CTGGCTGAAG CGGATGACCG CGAAGGCTTT
GATCGGCAGG CGCAGGTATT GCGTGAACGG GTCGAGGGCG AGGACGATCC CCTCTGGCGC
CAGGCGGAGC GCGTTGGCCG GGGCTATGCC CCTGACAATC CGCTGTTTGG TGGCGCTGAC
CCGGATGGCG CCGATGATGG AGGGGCGGAA GCCACCGGCG GTGCGCTGCC CACGCTCGAC
GAATCAGCGC CGGGGGGCGC CGATTTGGCG GAACCAGCGG AGGAACCTGC CCGGGAGGGT
GAGGCCTTCG ATTTCACTCT CGACTTCCCG GAGCCCGATG AGCCGGGCCG GGAGACGGAT
CAGGTAGCCC GACAGGGGGC GTCACGGGAA ACGGCGGCGC CGGAGACCCC TGCGACACCC
GCCGGTGAGG AAGGCGCTGC CGGTGGGGAA GGGGGTGATT TCACCCTCGA CTTCGAAGTG
GACGATAGCT GGCGGGAAGC GGCCGACTCG CGCCCGGCTG CGGATGCTGG CGAATCCCCG
GGGGACGAGG CCTTCGACCT GGACCTGGGC GACCTGACCT TGGACGAGAG CGGGGCCGGG
GATGCGACCG GGGAGGAACG TGGTGGCGGC GAACCGTCAC CAGGGGAGGC ACCGGACGCC
GATCAGCCGC CGGCACCTCC CGCGGAGGCG CCGATGGAGG ACGCCGGTGG GGACGAGGAG
ATCGCCACCA AGCTGGACCT CGCCCGGGCC TACGTGGATC TGGGTGATCC GGACGGTGCC
CGAGAGTTGC TCAACGAGGT CCTGGAGGTC GGCACCCCGG CCCAGCAGGA GGAGGCGCGG
AAGCTCCTGG GCGACCTCTG A
 
Protein sequence
MSFKRVFISA VAGLTFSSTV LALGLGTIER DSWLNQPLSA RIPLHSADTV SLEALQVTMA 
SQETFERAGL DRPPYLRDIR FEVIEDHTEG PHIRVYTRDP FREPFVDFLV ELNWPQGRLV
REYTLLLDPP RDPAERPVAA TPPRTEPAEV EEQPLETASP TVRDGHYGPV AGGETLWSVA
DRVRHQGVSA QQMALALFEA NPRAFVANDI NRLQAGATLT VPDAEGARAY SRNEAVRRFN
ALAEGVAEEP AEEPPVAEAP DPETVDRQLQ ILAESERDEE ALASLLDGDV EPSEENLGAL
REELLRARED QASLRSENEA LRERVSEMAQ EIERLERLLT LDVETGVLPM VPSEQPEGAV
AEPPPERPAV VDEPEVEPEI TEETPAEDDE APVTAPPPAP SSPQPLAWLD GLSLPVALAG
VGILIGLLAL LLIRRRRTET GVAAEEVPIQ QPPAREGVAA GTAAAASAVA DGDEPEPEPA
PEPRSEPQDP LQVADELADR GDLEGARATL LTALNNVPAR NEYRVRLLEI LAEADDREGF
DRQAQVLRER VEGEDDPLWR QAERVGRGYA PDNPLFGGAD PDGADDGGAE ATGGALPTLD
ESAPGGADLA EPAEEPAREG EAFDFTLDFP EPDEPGRETD QVARQGASRE TAAPETPATP
AGEEGAAGGE GGDFTLDFEV DDSWREAADS RPAADAGESP GDEAFDLDLG DLTLDESGAG
DATGEERGGG EPSPGEAPDA DQPPAPPAEA PMEDAGGDEE IATKLDLARA YVDLGDPDGA
RELLNEVLEV GTPAQQEEAR KLLGDL