Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1884 |
Symbol | |
ID | 4269739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2146636 |
End bp | 2147607 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126640 |
Product | hypothetical protein |
Protein accession | YP_742718 |
Protein GI | 114321035 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.65449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.844493 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCC GGCACCGGCT CCACTCAGGC CTGCTGCGGC TGCCCCGACG CCGTGGGGAG AGCACCTCCG ACCGCCTTAC CCTCGATTAT CGGCGGATCT TCATCCTCCC CAGCCGCTAC GGCCTGTTTT TGACTTTGGT GGCGGCTCTG GTCTGGCTCG GCGGGGTGAA TTACACGAAT AACATGGTGC TCCTGCTCTC CTTTCTGCTG ATCGGGCTGA TCGTGGTGAG CATCCACCAT ACCTTCCGCA ACCTGCACCG CCTCATGCTC CTGGCCGGGC CGGCGGACGC CGTCTTTGCC GGCCAGACGT TGCACTTCCC GGTAACCGCC CACAACCCCA CCCGACATGG CAAGCCGGCG CTCACCCTGG TGGGCGGCGA GGGCCAGCAG ACAGCCGATC TGCCCCCCGG CGGGAGCGTA CGCTGGTGGC TGCCGGTGCC CACCCACCGG CGCGGCTGGC AGACCCTGCC GCGGTTCCGA GTGCACAGTC GGTTTCCCAC CGGGCTCTTC GTGGCCTGGG CGCTGCCCGC ACCGCGCCAG CGCGCGCTGG TCTATCCCAC CCCCGAGCAC GGTGCGGTGC CCCCACCCCC ACACAGCGCC GGGGGCCACC AAGGCGAACG TCACGGTGAG GGGGACGACG ATTTCCGTGG CCTGCGCCGC TACCAGGCCG GCGACCCCAA GGGTCACATC GCCTGGAAGC GGCTGGCCCG CGGCGAGGAG CACCTGCACA CCAAGCAATT CTCCGGCGCC GCCGGCGCCC CGCGCTGGCT GGATGAGCGG GCCATTGACC CCCACCTGGA CCCGGAGGCG CGCCTTGCCC GGCTCTGTCG CTGGGTGGTT GACCTGGACC GGGCCGGCCA CCCCTACGGC CTGCGGCTCG GTCACCTGCG GATCGCGCCG GGCCGGGGTG AGGCCCACCG CCATGCCTGC CTGCGTGCCC TGGCCCTGTA CGACCCGGAC GCCCGTCAAT GA
|
Protein sequence | MSVRHRLHSG LLRLPRRRGE STSDRLTLDY RRIFILPSRY GLFLTLVAAL VWLGGVNYTN NMVLLLSFLL IGLIVVSIHH TFRNLHRLML LAGPADAVFA GQTLHFPVTA HNPTRHGKPA LTLVGGEGQQ TADLPPGGSV RWWLPVPTHR RGWQTLPRFR VHSRFPTGLF VAWALPAPRQ RALVYPTPEH GAVPPPPHSA GGHQGERHGE GDDDFRGLRR YQAGDPKGHI AWKRLARGEE HLHTKQFSGA AGAPRWLDER AIDPHLDPEA RLARLCRWVV DLDRAGHPYG LRLGHLRIAP GRGEAHRHAC LRALALYDPD ARQ
|
| |