Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2501 |
Symbol | |
ID | 4270820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2841443 |
End bp | 2842669 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638127259 |
Product | hypothetical protein |
Protein accession | YP_743331 |
Protein GI | 114321648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGCA CATGCCATAC CAAAAAACCG GTTTGCCTGG GGGTGGCCAC CGCCGCACTG CTGGGGCTGA TCATCTCACC GGCGTGGGCG GAGACCACCG TCGATTTCGG CGGCTACGTG AAGATGGATG CCCACTTCAG TGATGTGACC AACCGTGCCG CCAACGACCG CAGCGAGGCC TTCATGATCC CCGCCCTGAT CCCGCTGGAC GGGCAGGAGG ACTCACGGAA TGTCACCCGC TACAGCGTGC GTGAGAGTCG GGTCAACCTG CGCACGCAGA CGCCCACCGG CCTGGGGGAT CTGACCACGT TTCTCGAAGT GGACTTTCTC GAGGATGCCG CCATCGACCG CAACCGCCTG GTGGGCAACC AGCCCCCGCG GCTACGGCAC GCCTTCGGTC AGCTGGGCAA CTGGCTGGCC GGGCAGACCT GGGGCACTTT CTACAACGTT ACCACCAAAC CGGAGACCCT GGATTTCGTC GGGCCCGCCG GCACGGTGTT CAATCGCAAT ATCCAGGTGC GCTACACCCT GCCGTTGGAG CAGGGCAACA GCCTGATGCT GGCGGTGGAA CAGCCCTTCA CCACCCTGGC CTCCGAGGCG ACGTTGGGTG AGGCCGATCC GGGCGATGCC ATCCGCAACG CCCGGGATGA CCGCTGGCCG GAGTTCGTCG CCCGCTACAA CGTCTCGGGG GATTGGGGCC ACGGCTCCCT GGCTGGCGTC GCCCGGAATC TCCGGGTCGA TCGCAGCACC TCGCGGGAAC TCGGCGCCGA CGTGGACGAC GACGAGTGGG TGGGCGCACT CAGCCTGACC GGCGTGGTGA AGGCCGGAGG GCGCAATGAT GTCCGCTTTC AGCTCAACTA CGGCGACGGA CTCGGCCGTT ACCTCGGCCT GAACGCCTTT CCCGATGCCT TTATCGACGA CCAGGGCAAT CTGGACTCCT TGAGCATCTG GGGTGGGTAT GTGTCCTATC GGCACTGGTG GAATCAGACC CTGCGCAGCA GCCTGGTCTA CAGCCTGGCC AAGGCCGACA ACCCCAGCAG CGCCCCGGAG ACGGCCAACG AGCAGATCCA GTCGGTGCAC CTCAACCTGA TCTATACACC GGTGCAGAAC GTGGACGTGG GCGTGGAGTA CATCTGGGCC GAGCGGGAGA TCGAGGGCGA GGACGCCTAT GGTGAGGACA GCGGCGAGCT GAACCGCGTG CAGGTCTCGG CGAAGTACAG CTTCTGA
|
Protein sequence | MRSTCHTKKP VCLGVATAAL LGLIISPAWA ETTVDFGGYV KMDAHFSDVT NRAANDRSEA FMIPALIPLD GQEDSRNVTR YSVRESRVNL RTQTPTGLGD LTTFLEVDFL EDAAIDRNRL VGNQPPRLRH AFGQLGNWLA GQTWGTFYNV TTKPETLDFV GPAGTVFNRN IQVRYTLPLE QGNSLMLAVE QPFTTLASEA TLGEADPGDA IRNARDDRWP EFVARYNVSG DWGHGSLAGV ARNLRVDRST SRELGADVDD DEWVGALSLT GVVKAGGRND VRFQLNYGDG LGRYLGLNAF PDAFIDDQGN LDSLSIWGGY VSYRHWWNQT LRSSLVYSLA KADNPSSAPE TANEQIQSVH LNLIYTPVQN VDVGVEYIWA EREIEGEDAY GEDSGELNRV QVSAKYSF
|
| |