Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0491 |
Symbol | |
ID | 4268359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 537341 |
End bp | 538624 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125231 |
Product | aminotransferase |
Protein accession | YP_741335 |
Protein GI | 114319652 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | [TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.809579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAG CCCAGGAAAC CACCGACCGG CAGGGCATGC TGTCCCCCCT GCTCAAGCAG TCCAGTGGGG TGGTCGCCGA ACGGGGCGAG GGGGCCTATC TGTTTGATCG TGACGGCACC CGCTATCTCG ATTTCACTTC CGGCATCGGC GTCACCGCCA CCGGCCACGC GCACCCCAAG GTGGTGGCGG CGATCAAGGC CCAGGCGGAC AAGCTCCTGC ACGGCCAGTA CGCCATCGTG CGCCACCCCG GCATTATGGA ACTGGCCGAG CGGCTGGGCG CCTACATGCC CGGTCCCATT GATGCGCTGT TCTTCTCCAA CGCCGGCACC GAGGCCTGCG AGGCGGCGCT GCGCCTGGCG CGGCACGCCA CTGGCCGGCC CAATATCATC GTCTTCCACG GTGGGTTCCA TGGCCGCACC ATGGGCTCGC TGTCCATGAC CACCTCCAGT GTCGGCCTGC GGGCCGGCCT TCAGCCCATG ATGGGCGGCG TGGTGGTAGC CCCCTTCCCC AACACCTACC GCTATGGCTG GGACGAGGAG GCCGCCACCG ACTTCTGCCT GCGGGAACTG GACTACATTT TCGCCACCTA CAGCACCCCG GCGGAGACGG CTGGCGTGTT CATCGAGCCG GTGCAGGGCG AATCGGGCTA CGTGCCCGCC AACACCCGCT TCATGCAGGG CCTGCGCGAG CGCTGTGACC AGCACGACAT GCTCATGATC CTCGACGAGG TGCAGGCCGG CTATGGCCGC ACCGGCCGCT TCTGGGCCCA CAGCCACTTC GAGGTGCAGC CCGATGTGGT GGTGACCGCC AAGGGCCTGG CCAGCGGCAT GCCCCTGTCC GGCGTCGGTG CCCCCTCCGA GCTGATGGAG CGGGGCTGGG CGGGCTCCCA GGGCGGCACC TACGGCGGTA ACGCCGTCGC CTGTGCCGCG GCGCTGGCCA CCCTCGATGT CATTGAGGAA GAGGGCTTGG TGCACAATGC CGCCGAGCAG GGCGCTTACC TCAAGCAGCG GCTGAAGGAG GTCCAGGCGG AGTTTCCCGA GGTGGCCGAC GTGCGCGGCA TGGGGTTGAT GATCGGCACT GAGATGGTGG ACGCCGAAGG GCGCCCCGAC GGTGACCGGG CCGCGCGTAT CCTTAAGGCC ATGGAGAAGC GCAAGGTATT GATGATCCGC TGCGGGGCCT TCGGCGGGCA GGTGGTCCGC TGGCTGCCGC CGCTGATCGT CAGCCGCCAG CAGGTCGACA CCGCAGTCGA CACCTTCATA GAGGCCCTGC GGGAAACCGC CTGA
|
Protein sequence | MSAAQETTDR QGMLSPLLKQ SSGVVAERGE GAYLFDRDGT RYLDFTSGIG VTATGHAHPK VVAAIKAQAD KLLHGQYAIV RHPGIMELAE RLGAYMPGPI DALFFSNAGT EACEAALRLA RHATGRPNII VFHGGFHGRT MGSLSMTTSS VGLRAGLQPM MGGVVVAPFP NTYRYGWDEE AATDFCLREL DYIFATYSTP AETAGVFIEP VQGESGYVPA NTRFMQGLRE RCDQHDMLMI LDEVQAGYGR TGRFWAHSHF EVQPDVVVTA KGLASGMPLS GVGAPSELME RGWAGSQGGT YGGNAVACAA ALATLDVIEE EGLVHNAAEQ GAYLKQRLKE VQAEFPEVAD VRGMGLMIGT EMVDAEGRPD GDRAARILKA MEKRKVLMIR CGAFGGQVVR WLPPLIVSRQ QVDTAVDTFI EALRETA
|
| |