Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0842 |
Symbol | |
ID | 4270779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 956389 |
End bp | 958446 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125594 |
Product | hypothetical protein |
Protein accession | YP_741686 |
Protein GI | 114320003 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.456068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTGA ACCTGGCGCC TCACATTGCC AATGCAGGCG ACGCCCATAT GCCTGAGTGG CCGGGCGAAG CCGGTGACGC CGAGCGTAAC GACCTGGAGT CGCCCGCCGA GCTCTATTCG CAGCATCGCG AAGCCATAGA GCAGGTCACT GCCCGTTCAG GCGGCCACAC GGGCTTCATT GGCGGGGCGC TGGCGCCCAA CGGCATCGTC GACGTGATCC GCCGGCAGGC GATGGAAGAC TTCCTGGCCT ATGCCACCCC GATGCAGGCG CAGTGGCAGA GCGAGTATCA CCGTATCGGG GCCGACCTGG CCGCCATACT GCCCACCTGG CACACCCAGG CCCTGCTGCT CGACCGCGAA GAGGAGCATC ACATCCTGCT GACCTGCCTG CTGGAAAAGC AGGCCGTCGA GACCCTGCTG GCCTGCGGGC AGGAGGACTT CCTGTCGAGC TACTACGCCG GCGACGACCC GGTACCGGCA CACCTGATGC ATTACGTCCC CACCGCGTCC TTCGTGGAGG GATTCCTTTC CCAAAACACT GGCCTCCAGA AAGCGCTCAC CCAGGCGTCC GCACTGATGG GCGCCCAGGG CGCACTCAGC CGCTACCAGC AGTGGCGGGG CGAGGTTGAA CATCAGACCG GGCTGCGCTT TCGCAGCGTC GAGGGCCTCT CCGACGAAGC CCGAACCGCC ATCGCGGGCG AAGTGCAAAT CAAGGAGAAG CTGCTGGGGC AGGCGGTGCT CGGGACGTTG CTCGATGACG TACAGGATGT CGACCTGGGG CAGCGCATCA CTACCCTGGC TTCGCGCTTG CCCGATGGGC AACGGCTGAT GTTCGCGGAA CGGCTGGGCC TGCTGGAGCT GGGCTGGGAC ATTCCCGATC AATCGGTACT GGGCCGCATC CAGCAGGCGC TGGACGACAC CGATACGGCC ATGACCCGGC TGACCGCGCT GGAGCGGGAG CTCGAGCAAC TCCGGCGGAA ACGCCAGCAG GAGATGGCGC GCGCGTCAAG GCGCGGTACC CGCCAGGCCC ACCGCCGGGC GGCCGACCAA TTCAACGCGC GCAAGATCCG GGAGGCCCAG GCAGACATCA ACCGGCACAA GCGCCTGCTG GGCGAGGCAT TCGACGCCCT GGCCGAGACC CGTGCGGCCG CCGGCAGCCC GGCGCAACTG GAGCGCTGGG CCAGGGTAGC CAATGGGGCG ATGGGGGCCG TGGCCCTATT GGGTGGCCTT TCAGGTGCTC TCGAGGTTTT CAAGCAATCT CGGCGTATCG ACCGCGCAGA CACCGACGCC GAGCGACTGG CCTCACAGGT TGCCTTTACG GGCGCGGCCG GAGTCGCCCT GGGTGGCCTA TCAATCGGCA TCATGAGTAT GGTCGGTCGA ACCCTGGGCA AGCCGGCCGT CGCCTGGCGA CTGCTGCTGC TCAAATTCGC CGGCCCCGCC GGCTGGGTGG TGGCGGTTGG CACCGCCCTG CTGATCATCG GCGAGGTGCT GGCCAATCGC TTCTCGTTGA GCCCTGTGCA GCGCTGGTGC CAGCGCAGTC ACTGGGGGCG AGAAGATCAG GGCTGGGATC GCGAGGCCCA CGAGCGGGAA CTGGCCCGAC TTGGCGATAC CGATCTCACG GTGGAACGGC AGGGGCAGGC CGAGCCCCAT GGCGGCCCGG GGCCCGGGCC GGCAGGCACC GACCTCGCCA TACGCATTGG CTTGCCCGGG CTTGACGCCC CCAATGCGGA AAACCTCGCG CTGGGCCTCT GGGGCGTCAC CCCTCGCCTC AAGGAAATGA CCCGAGACTT TCTCGAACAT GCCGAGCTCG AAACCCGGGG CTCGAGCTAT GCCCTGCACT ACCATTTCGA TCCCGAAACA TTGGCCGAAT GCCACGAGTT CCGCCTCGTC ATCCGCACGA AGGGCCCCGA AGCATCCACC ACCCGGGTCT TCCAGTTGCA TCGCCGCGGC ACATCGCTCT CCGATGAGTG GAGGGAGATC TCCGCCCTCG GCGATCGTTT CCTCACGCGG TACCAAGTGG GCAACTGGCC GGACATGCCC CTGACGCCCT GGCCGTGA
|
Protein sequence | MSLNLAPHIA NAGDAHMPEW PGEAGDAERN DLESPAELYS QHREAIEQVT ARSGGHTGFI GGALAPNGIV DVIRRQAMED FLAYATPMQA QWQSEYHRIG ADLAAILPTW HTQALLLDRE EEHHILLTCL LEKQAVETLL ACGQEDFLSS YYAGDDPVPA HLMHYVPTAS FVEGFLSQNT GLQKALTQAS ALMGAQGALS RYQQWRGEVE HQTGLRFRSV EGLSDEARTA IAGEVQIKEK LLGQAVLGTL LDDVQDVDLG QRITTLASRL PDGQRLMFAE RLGLLELGWD IPDQSVLGRI QQALDDTDTA MTRLTALERE LEQLRRKRQQ EMARASRRGT RQAHRRAADQ FNARKIREAQ ADINRHKRLL GEAFDALAET RAAAGSPAQL ERWARVANGA MGAVALLGGL SGALEVFKQS RRIDRADTDA ERLASQVAFT GAAGVALGGL SIGIMSMVGR TLGKPAVAWR LLLLKFAGPA GWVVAVGTAL LIIGEVLANR FSLSPVQRWC QRSHWGREDQ GWDREAHERE LARLGDTDLT VERQGQAEPH GGPGPGPAGT DLAIRIGLPG LDAPNAENLA LGLWGVTPRL KEMTRDFLEH AELETRGSSY ALHYHFDPET LAECHEFRLV IRTKGPEAST TRVFQLHRRG TSLSDEWREI SALGDRFLTR YQVGNWPDMP LTPWP
|
| |