Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1232 |
Symbol | |
ID | 4269016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1436029 |
End bp | 1438449 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638125982 |
Product | hypothetical protein |
Protein accession | YP_742071 |
Protein GI | 114320388 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.755816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.21821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTA AACGGGTCTT CATCTCCGCC GTTGCCGGGT TGACGTTTTC CTCCACCGTC CTGGCCCTGG GCCTGGGCAC CATCGAGCGT GACTCCTGGC TCAATCAGCC GCTCTCCGCG CGGATTCCGC TGCACTCGGC GGACACCGTT AGCCTGGAGG CGTTGCAGGT CACCATGGCC TCGCAGGAGA CCTTCGAGCG CGCCGGGCTC GACCGGCCTC CCTACCTGCG CGATATCCGC TTTGAGGTGA TCGAGGACCA CACCGAGGGG CCCCATATCC GGGTCTATAC CCGGGATCCG TTCCGCGAGC CTTTCGTTGA TTTCCTCGTG GAGTTGAACT GGCCGCAGGG CCGTTTGGTG CGGGAATACA CCCTGTTGCT GGACCCGCCC CGCGACCCCG CGGAGCGGCC GGTGGCGGCC ACGCCGCCGC GGACCGAACC GGCGGAGGTC GAGGAACAGC CACTGGAAAC GGCGTCCCCC ACGGTTCGGG ACGGCCACTA CGGGCCGGTG GCCGGCGGTG AGACCCTGTG GTCGGTGGCC GACCGGGTGC GCCATCAGGG GGTCAGTGCC CAGCAGATGG CCCTGGCCCT GTTCGAGGCC AACCCGAGGG CCTTTGTCGC CAACGATATC AACCGGCTAC AGGCCGGCGC CACGCTGACT GTGCCCGATG CCGAGGGGGC CCGGGCCTAT TCCCGCAACG AGGCCGTGCG CCGCTTCAAT GCGCTGGCCG AAGGCGTGGC GGAGGAGCCG GCGGAGGAGC CGCCCGTAGC GGAGGCGCCG GATCCGGAGA CGGTGGACCG GCAATTGCAG ATCCTCGCCG AGTCGGAACG CGACGAAGAG GCCCTGGCGA GCCTTCTCGA TGGTGATGTG GAACCCAGCG AGGAGAATCT GGGCGCGCTG CGCGAGGAGC TGCTGCGTGC CCGCGAGGAC CAGGCCAGCC TGCGTTCGGA GAACGAAGCC CTGCGCGAGC GGGTGTCGGA GATGGCGCAG GAGATCGAGC GACTGGAGCG GCTGCTCACC CTGGACGTGG AGACCGGGGT GCTGCCGATG GTGCCGAGCG AGCAGCCTGA AGGGGCCGTG GCCGAGCCTC CACCGGAGCG CCCGGCGGTG GTGGATGAGC CCGAAGTCGA GCCGGAGATC ACTGAGGAAA CGCCGGCGGA GGACGATGAG GCGCCGGTGA CGGCGCCCCC GCCGGCCCCC TCTTCACCGC AACCGCTCGC CTGGCTCGAC GGGCTCAGCC TGCCTGTGGC GCTGGCGGGC GTCGGCATCC TGATCGGGCT CCTGGCCCTG CTGCTCATAC GGCGCCGGCG GACGGAGACC GGCGTGGCCG CGGAGGAGGT GCCCATTCAA CAGCCGCCGG CGCGAGAAGG CGTGGCGGCC GGGACGGCGG CCGCGGCGAG CGCGGTGGCC GACGGGGACG AACCGGAGCC TGAACCTGCG CCTGAGCCCC GCTCCGAGCC CCAGGATCCC CTGCAGGTGG CCGATGAGTT GGCAGACCGG GGCGATCTGG AGGGTGCCCG CGCCACCCTG CTGACGGCGC TCAACAATGT GCCGGCCCGC AACGAGTACC GGGTGCGCCT ATTGGAGATC CTGGCTGAAG CGGATGACCG CGAAGGCTTT GATCGGCAGG CGCAGGTATT GCGTGAACGG GTCGAGGGCG AGGACGATCC CCTCTGGCGC CAGGCGGAGC GCGTTGGCCG GGGCTATGCC CCTGACAATC CGCTGTTTGG TGGCGCTGAC CCGGATGGCG CCGATGATGG AGGGGCGGAA GCCACCGGCG GTGCGCTGCC CACGCTCGAC GAATCAGCGC CGGGGGGCGC CGATTTGGCG GAACCAGCGG AGGAACCTGC CCGGGAGGGT GAGGCCTTCG ATTTCACTCT CGACTTCCCG GAGCCCGATG AGCCGGGCCG GGAGACGGAT CAGGTAGCCC GACAGGGGGC GTCACGGGAA ACGGCGGCGC CGGAGACCCC TGCGACACCC GCCGGTGAGG AAGGCGCTGC CGGTGGGGAA GGGGGTGATT TCACCCTCGA CTTCGAAGTG GACGATAGCT GGCGGGAAGC GGCCGACTCG CGCCCGGCTG CGGATGCTGG CGAATCCCCG GGGGACGAGG CCTTCGACCT GGACCTGGGC GACCTGACCT TGGACGAGAG CGGGGCCGGG GATGCGACCG GGGAGGAACG TGGTGGCGGC GAACCGTCAC CAGGGGAGGC ACCGGACGCC GATCAGCCGC CGGCACCTCC CGCGGAGGCG CCGATGGAGG ACGCCGGTGG GGACGAGGAG ATCGCCACCA AGCTGGACCT CGCCCGGGCC TACGTGGATC TGGGTGATCC GGACGGTGCC CGAGAGTTGC TCAACGAGGT CCTGGAGGTC GGCACCCCGG CCCAGCAGGA GGAGGCGCGG AAGCTCCTGG GCGACCTCTG A
|
Protein sequence | MSFKRVFISA VAGLTFSSTV LALGLGTIER DSWLNQPLSA RIPLHSADTV SLEALQVTMA SQETFERAGL DRPPYLRDIR FEVIEDHTEG PHIRVYTRDP FREPFVDFLV ELNWPQGRLV REYTLLLDPP RDPAERPVAA TPPRTEPAEV EEQPLETASP TVRDGHYGPV AGGETLWSVA DRVRHQGVSA QQMALALFEA NPRAFVANDI NRLQAGATLT VPDAEGARAY SRNEAVRRFN ALAEGVAEEP AEEPPVAEAP DPETVDRQLQ ILAESERDEE ALASLLDGDV EPSEENLGAL REELLRARED QASLRSENEA LRERVSEMAQ EIERLERLLT LDVETGVLPM VPSEQPEGAV AEPPPERPAV VDEPEVEPEI TEETPAEDDE APVTAPPPAP SSPQPLAWLD GLSLPVALAG VGILIGLLAL LLIRRRRTET GVAAEEVPIQ QPPAREGVAA GTAAAASAVA DGDEPEPEPA PEPRSEPQDP LQVADELADR GDLEGARATL LTALNNVPAR NEYRVRLLEI LAEADDREGF DRQAQVLRER VEGEDDPLWR QAERVGRGYA PDNPLFGGAD PDGADDGGAE ATGGALPTLD ESAPGGADLA EPAEEPAREG EAFDFTLDFP EPDEPGRETD QVARQGASRE TAAPETPATP AGEEGAAGGE GGDFTLDFEV DDSWREAADS RPAADAGESP GDEAFDLDLG DLTLDESGAG DATGEERGGG EPSPGEAPDA DQPPAPPAEA PMEDAGGDEE IATKLDLARA YVDLGDPDGA RELLNEVLEV GTPAQQEEAR KLLGDL
|
| |