Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1027 |
Symbol | |
ID | 4269768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1171847 |
End bp | 1173487 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125779 |
Product | hypothetical protein |
Protein accession | YP_741870 |
Protein GI | 114320187 |
COG category | [S] Function unknown |
COG ID | [COG4425] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.847677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGAG TCACCCGAAT CTTCGGGCAG TTCTCCACGG CGGGGCTGCT GTTGGGCACC CTGTTTTTCG CCTTCTCGCT GACCCCGAGC CTGTTGCCCC GGCCTTTCCT GGTGCAGGGT GTCATCTCCG GGCTGTCGTT CGCCGCAGGG TATGCCCTGG GCTTCGCCGG CCAGTGGCTG TGGGCCTATT TGGAATTGCC CGCCCCCCGC GCCCGCCTGG GCAGTGCCGT GAAACTGCTC GCGACCCTGG CCTGCGTGGT GATCGCGGGC CATTTCCTGG CCCGGGCCTC GGAGTGGCAG AACTCCGTGC GGACCCTGAT GGGGGTGGAG CCTGTGAGCG GGATCCGGGC CTACAGCATC GCGTTGATTG CCCTAGCGGT CTTCGCCCTG TTGTTGCTGC TGGCACGCCT TTTCCGTCAC ACCTTCCTGC TGCTCTCGGC GCGGCTGCAG CGCCATGTGC CCCGCCGGGT GTCGCACGTG GCCGGGATCG GCGCCGCGCT GCTGCTGTTC TGGTCGGTCA TCGACGGGGT GATCTTCACC CTGGGCCTGC GCGCCGCCGA TAACTCCTAC CAACAAGTGG ATGCCTTGAT CCAGGATGAC TTGGATCCGC CGGAGGACCC GATGCGTACC GGCAGCGCCG CCTCCCTCAT CACCTGGGAG GAGTTGGGCA GCCGCGGGCG CCGGTTCGTC AGCAGCGGGC CGACAGCGGA GGACCTGCGC TGGTTTCACG GCGAACCGGT GCCGGAGCCC ATTCGGGTCT ATGTGGGGTT GAACGCGGCG GAGACCCCGG AGGCCCGGGC CGAGCTGGCC CTGGAGGAGC TCAAGCGGGT GGGTGGTTTC GACCGCTCGG TGTTGCTGAT CGCAACCCCC ACCGGGCGGG GTTGGGTGGA CCCGGCCGCC CAGGAACCGG CCGAGTACCT GCACCGTGGC GATATCGCGA CGGTGACCGC GCAGTACTCC TACCTGCCCA GCCCCTTGTC GTTGCTGGTG GAGGGTGACT ACGGGGTGGA GACCGCCCGC GCCCTGTTTC AGGCCGTGTA CGGGCATTGG AGCCGCCTAC CGGAGGACGA GCGGCCCCGC CTCTATCTCC ACGGTCTGAG CCTGGGGGCG CTGAATTCCG ATCGCTCCTT CGATGTCTAC GACATCATTC AGGATCCGTT CGACGGGGCG CTCTGGAGCG GTCCCCCCTT TCGCAGCGAG ACCTGGCGTA CCGTCACCCG CGGCCGGGAC GCCGGATCAC CGGCCTGGTT GCCCCGGTTC CGTGACGGCT CGGTGGTCCG ATTCATGAAC CAGTACGAGG GCCTGGAGGA TCAGGGTGAT GAGTGGGGGC CCTTCCGGAT CGCCTTCCTG CAGTATGCCA GCGACCCGGT GACGTTCTTT GATCCCGCCG TGCTCTATCG TGAACCGGAA TGGATGCGGG AGCCGCGTGG CCCGGATGTC TCCACCGAAC TGCGCTGGTA CCCGGTCGTC ACGATGTTGC AGCTGCTGGC CGATATTGCG GTGGGAGGGG CACCCCGGGG GCATGGCCAT GAGATCGCCG CCGAACACTA TGTCGATGCC TGGGTGGCGC TGACCGAGCC GGAGGGCTGG TCTGAGTCGG AGCTGGACCG GCTGCGCGGC CGGTCCCGGC CGGAGGAGTG A
|
Protein sequence | MRRVTRIFGQ FSTAGLLLGT LFFAFSLTPS LLPRPFLVQG VISGLSFAAG YALGFAGQWL WAYLELPAPR ARLGSAVKLL ATLACVVIAG HFLARASEWQ NSVRTLMGVE PVSGIRAYSI ALIALAVFAL LLLLARLFRH TFLLLSARLQ RHVPRRVSHV AGIGAALLLF WSVIDGVIFT LGLRAADNSY QQVDALIQDD LDPPEDPMRT GSAASLITWE ELGSRGRRFV SSGPTAEDLR WFHGEPVPEP IRVYVGLNAA ETPEARAELA LEELKRVGGF DRSVLLIATP TGRGWVDPAA QEPAEYLHRG DIATVTAQYS YLPSPLSLLV EGDYGVETAR ALFQAVYGHW SRLPEDERPR LYLHGLSLGA LNSDRSFDVY DIIQDPFDGA LWSGPPFRSE TWRTVTRGRD AGSPAWLPRF RDGSVVRFMN QYEGLEDQGD EWGPFRIAFL QYASDPVTFF DPAVLYREPE WMREPRGPDV STELRWYPVV TMLQLLADIA VGGAPRGHGH EIAAEHYVDA WVALTEPEGW SESELDRLRG RSRPEE
|
| |