Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0043 |
Symbol | |
ID | 4270912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 47470 |
End bp | 48432 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638124769 |
Product | Rhs element Vgr protein |
Protein accession | YP_740891 |
Protein GI | 114319208 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGCC CGCAGATCGC CCGGGTGGTG GGCCCCGAGG GGGAGGCGAT CCATTGCGAT GAGCACGGCC GGGTCAAGGT CCGTTTCCCC TGGGACCGCT ACGCCGCCGA TGACGAGCAC GCCAGCGCCT GGCTGCGCGT CGCCCAGCCC TGGGCCGGGC CCGGCTACGG CGGGCTGTTC CTGCCCCGGG TGGGCCATGC GGTGATCGTC GACTTCCTGG CCGGCGACCC GGATCAGCCG GTGATCACCG GCCGGGTCTA CGATGGCCAC AACACCCCGC CCTATCCGCT GCCCGCGCAC AAGACCCGCA GCGTGCTGCG CAGCCGCAGC CAGGACGGTG AGGGCTACAA CGAACTGCAC TTCGAGGATG CCCGCGAGGC CGAGCGCATC CACCTGCACG CCCAGCGCGA TCTCGACCTG CACACCCGCA ACGACCGCTC CGAGACCATC GGCCGCCACA GCCACCTGGG CGTCCACGGC GACCGGCTCG CGGAGATCCA CGGCGACGAG CACCTCACCG TGCAGGGCGA GCGGCGCGAG CGCACCGGTG GGGATCAGCA TCTCAGCGTG GAGGGCACCC TGCATCTCAA AGCCGGTGAG GCCTGGCTGA GCGAATGCGG CCGGGAACTG CACGTCAAGT CGGGGCACAA GGCGGTCATC GACGCCGGTG CCGAGATCAC CCTCCAGGCG GGCGGCAGTT TCATCAAGGT CGATCCCTCG GGCATCACCC TCAGCGGCCC CGGCATCCGC ATGAACTCCG GGGGCAGCCC TGGATCGGGC TCGGGCCAGG CGGCCCTCGC ACCGGAGGTG CCGCGGGAGG CGGAGTTGGC ATCGCAAAGG GAGGTGGAGC CGGCCCCTAA CGTGCCGCGT ATTGAGCCGA TTCGGCAGGA GGCAGCCCTG CGCCAGGCGA TGGCTTTGAC ACAGCCCTGC ACACCCACTG GCGATGACGA GGGATCGGCA TGA
|
Protein sequence | MEGPQIARVV GPEGEAIHCD EHGRVKVRFP WDRYAADDEH ASAWLRVAQP WAGPGYGGLF LPRVGHAVIV DFLAGDPDQP VITGRVYDGH NTPPYPLPAH KTRSVLRSRS QDGEGYNELH FEDAREAERI HLHAQRDLDL HTRNDRSETI GRHSHLGVHG DRLAEIHGDE HLTVQGERRE RTGGDQHLSV EGTLHLKAGE AWLSECGREL HVKSGHKAVI DAGAEITLQA GGSFIKVDPS GITLSGPGIR MNSGGSPGSG SGQAALAPEV PREAELASQR EVEPAPNVPR IEPIRQEAAL RQAMALTQPC TPTGDDEGSA
|
| |