Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0998 |
Symbol | |
ID | 4795510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 1000041 |
End bp | 1001240 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640099661 |
Product | hypothetical protein |
Protein accession | YP_001030434 |
Protein GI | 124485818 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0306] Phosphate/sulphate permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.06049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.558836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACCA TTACTTTTAT TGTTGTTATT CTCGGGATCA TCATCGCGCT CGCGTTCAAC TTTTCGAATG GACGAAACGA TGCATCCAAT GTAGTTGCGA CCGTGGTCGC AACTCGTGCC CTTTCTCCGA GAAACGCGAT CGTTTTGGCA TCGGTCTGTT GTTTTGCAGG TCCGTTCATC TTCTCGACCG CAGTTGCTAA AACTATTGGG AAAGGAATCG TAAATCCGGA AATCTTCACC CCCATTCTTC TTTTGATCGG TCTTTGCGGA GCTGTTTTCT GGGTGAATTT CTGCTCGCGG TCAGGAATCC CGGTCTCTTC ATCGCATTCT CTTGTAGGCG GGCTGATGGG AGCGGGAATT GCCGCCGGCG GTCTTTCGGT CGTGAACTGG CCTACATCGG AAATGGCTCT TGGGATGGTG TATTATCTGA TTCTCGGTGC AGTTATCGGC GCGATCGTAA TGCTGATTGT AGCACTCATC TTCAAGGACT CGGTAAAACT GTTTCTTCTG CTCGGGGCCG GGGCCGGAGC GATTTTGGCC GTGCCTATTG CGATGATTTT AGGGATTTTT ACCTTTTCCG GGCTTCTTGC GATCCTGCTT TTCATCTGCG TTTCGCCGAT TCTTGGAATG ATCGTATCAT ATGTATTCAC GACGCTTTTA ACGAGGATCA CGGCAAAGCG TTCAAATCAT CCGATGCTTT TGAACAAATG GTTCCAGCGT GCCCAGGTTC TCGCATCCGG TTTCCAGGCG GCAAGTCTTG GTGGGAATGA TGCTCAGAAT GCAATGGGTA TCATCCTTGC CATTCTGATC TCGACAGGTC TTGCGGCATC CACCGACGAC CTGCCTCTTT GGGTGATCCT TCTCGCAAGT TTGGCGATCG CCGCCGGCAT TCTCTCCGGA GGATGGCGGG TGATCAAGAA AATGGGTTCG GGAATCACGA GGATCCTGCC GTATCAGGGT TTTTCCGCAG CAGTTTCCGG CGGAGCCGTG CTTTCGTTTA TGACGTCGTT CGGTGTTCCC GTCTCGACGA CCCATGTCGC GAGCGGGACG ATCATGGGAA CAGGGGTCAC CCGCGGCGTC GGAGCAGTGA ACTGGAGTAC GGTTCGTCAG ATGGTCACAG CCTGGGTTAT CACGATCCCC TGCGCAGCAG TGGTCTCGTT TTTGGCATAT ATCATTCTCG CTTTCGTGTT CGGATTCTGA
|
Protein sequence | MDTITFIVVI LGIIIALAFN FSNGRNDASN VVATVVATRA LSPRNAIVLA SVCCFAGPFI FSTAVAKTIG KGIVNPEIFT PILLLIGLCG AVFWVNFCSR SGIPVSSSHS LVGGLMGAGI AAGGLSVVNW PTSEMALGMV YYLILGAVIG AIVMLIVALI FKDSVKLFLL LGAGAGAILA VPIAMILGIF TFSGLLAILL FICVSPILGM IVSYVFTTLL TRITAKRSNH PMLLNKWFQR AQVLASGFQA ASLGGNDAQN AMGIILAILI STGLAASTDD LPLWVILLAS LAIAAGILSG GWRVIKKMGS GITRILPYQG FSAAVSGGAV LSFMTSFGVP VSTTHVASGT IMGTGVTRGV GAVNWSTVRQ MVTAWVITIP CAAVVSFLAY IILAFVFGF
|
| |