Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mhun_0432 |
Symbol | |
ID | 3924659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanospirillum hungatei JF-1 |
Kingdom | Archaea |
Replicon accession | NC_007796 |
Strand | - |
Start bp | 500228 |
End bp | 503236 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637896072 |
Product | hypothetical protein |
Protein accession | YP_501914 |
Protein GI | 88601736 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.259517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAA CCGATAAGAG ATTTAATATT TATGGGATAA GAATATTACA ATTAATTTTT ATCCTTTTTT TTATTACGGG AATAACAGTT GGAGTAATGG ATAAAAAAAC GGTTCCAGAC CTGGTTAAAG AATCAGATCT TGTCGTCACT GGGACTGTTA AAGATATTGA AAGTAATTGG GATGATGAAA AACTGAATAT ATACACAATA ACGACAATTA TTGTTAATGA GGTTTTTCTT GGACCAGCAA TTCCTAAAGA AGAAATTAAA ATACTTTCTA AAGGTGGAAC TGTTAATGAA ATTACGCAAT GGGTGGAAGA TGAACCATCC TTTTCAATTG GTTCTGATGT CGGTCTTTTA CTGAATTATC CATTAAAAAA TCAGTATTTA ACCAACAATA TGTCAGATGA AATTTGTACT GTGACTGGTA ATTTCCAGGG AGTTTTCTCT ATACAAAATA ATAAAGTTGA TTCTGAAGAG CATAACGAAT ATGGAAGATA TTCTATTGAC GAATTCAAAT TAATTATTGA AGAAGCAATT CAAGGTAGAG AACATATAAT TTCGTCAAAA AAACTCTTTG TTAAATCTGG ATATGAGGAG AAAAAAAAAC TTGAGACTTC TTCACCAATG ATACAAAGTG TCACTCCTTC TACTGCTTCT GCAGGAACAG ATACTATTAT TACTATAAAA GGAACTGGTT TTGGAACTAA ATCAAGTAGA GACTCAAATG CGGACGTTTT CTTTCTTTAT AGATTTAATG GACAAGACCA AACACTAATA TGGGCTTCTG GATTTCCTCA CTTTTCAATA AATGAATATG ATATTGTCTC ATGGACTGAT TCTCAAATTC GAGTTAAGGT TCCTGTTGGA AGGACTCAAG ATAATTATGA TGGTGGAGCA TCAAGTGGAT ATGTTGGAAT CCTGACGGAT AATGGTGAAG TTTCTAACGT TGCAGATTTT TCAGTTACTT TTTCATATGG GAAGAGAAAA TGGCCCGTAA CTTCAGTAAA TTATTATATC AATCCCGGTA GTATAATTGG GTCACTTACG GCCATTCAAA ATGCGGCAAA TTCATGGAAT GGTCAGTCAT CATTCAAGTT TAATTATGCT GGTACATCTT CAACTACAAA ATCAGGTGCT GATGGTAAAA ATGTGCTTCT TTTTAGAGAC TTAGGTAGTT CTGGCCCTAT TGCTCAGGCA ACATATTGGT TTTCATCTGG CATTATATCT GAAGCAGATG TTGAATTTAA TACCTATTAT TCCTGGACTA CTGATACAGC ATCTGGTGGA TTGAAAAATA TTGAAGCAAT AGCCATTCAT GAACTGGGAC ATTGGCTAAA TTTAAAAGAT TTATACGGTT GGGTTCCAGG ATTTCCTTCA GATTTAAGCC CAATCATGAA GGTCATGTTT GGATATAATG GTGACGTTAT TGGTAATACA AATTTAAAAA CATTATCTGC TTCAGAAATT GCAGGAATTA AGTGGATTTA TGGGGGATCC GGTCCGACTC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC TCCAACTCTA ACACCGACTC CTACACCCAC TCCAACTCTA ACTCCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC TCCAACTCTA ACACCGACTC CTACACCCAC TCCAACTCTA ACTCCGACAC CTACACCCAC CCCAACTCTA ACACCGACAC CTACACCCAC TCCAACTCTA ACTCCGACAC CTACACCGAT TCCGGTCCCT GTTGTGACAG GTATTATACC AGAATCCGGT CAGGCCGGAT TGACCATTAA CTATACTGTC ACCGGCTCCA ATTTCGTTGA TGGTGCGTTC GTCCATCTGG TAAAAGAAGG GCAGACCAAT ATTACTTCAA CAGGAACACT CAAGGAAGGA AAACTGACCG GAATATTCAA CCTTCCTTTG GATGCCATAA CGGGGCCATG GAATGTCATC GTCGAGCAGG ACGAGTTATT CAGTAATGAT AATATCTTAT TCACCATTAC TCCGGCTCCG GCAGATACCC CGGTAATAAC CTCGATAAAT CCGGACTCAG GAGAGAAAGG TACGGAAATT ATCTATTCAA TCACAGGTGA GCACCTGCTG AATGGCGCAA TAGTAAACCT TACCCATGAA GGTGAGGAAA ATATTACGTC AATCGGATAT CTTATGGGAT CACAGCTCTT TGGCACATTC GTGATACCAA CGGATGCTCT CACCGGACCA TGGAATGCAT CGGTGAACCA GAACGGACTC TATAGTAACG ATGAGATTCA GTTCACCATC ACAGATGGTC CGGTTCATAT CCCGGTTGTT CATAATATTA CCCCTGATTC AGGAATGCAG GGTGAATCAA CCGATTACCT TCTTCAGGGA GAAAACCTTA CGGATGGTGC TCTGGTAAAC CTCTCCCATC CAGAACAGGT CAATATTTCA TCAACTGGAA ATCTCTCAGA AGGAAACCTG ACAGGGACCA TCGCTATCCC CGATGATGCT CTGCCCGGAC CATGGAATGT GACCGTGAAC CAAAGCGGCC TTACCAGTAA TGATAATGTT CAGTTCATCG TTCTGCCATC CGGGCCGTTC CCGGTCGTAC GCTCCATTGC CATGTCATTT GCAGCACCAG GAAAAGGGAG CGGATTTGTT GTTTATGGAG AAAATTTTGA GAACGGGGCG ATTGTAAACC TGTCACACTC AGGTGAGAAG AACATCACCG CTATAGGTGA ATTAGTAAAA GGGACTCTAA CCGGGACATT TAGTATACCT GAATCATGCA GACCTGGACT TTGGAATGTA ACGGTGAATG TTCGGGGAAA AGTCAGTAAT GACAATGTCC AGTATCCCAT TAAAGGGAAA CTCCGGTAA
|
Protein sequence | MGKTDKRFNI YGIRILQLIF ILFFITGITV GVMDKKTVPD LVKESDLVVT GTVKDIESNW DDEKLNIYTI TTIIVNEVFL GPAIPKEEIK ILSKGGTVNE ITQWVEDEPS FSIGSDVGLL LNYPLKNQYL TNNMSDEICT VTGNFQGVFS IQNNKVDSEE HNEYGRYSID EFKLIIEEAI QGREHIISSK KLFVKSGYEE KKKLETSSPM IQSVTPSTAS AGTDTIITIK GTGFGTKSSR DSNADVFFLY RFNGQDQTLI WASGFPHFSI NEYDIVSWTD SQIRVKVPVG RTQDNYDGGA SSGYVGILTD NGEVSNVADF SVTFSYGKRK WPVTSVNYYI NPGSIIGSLT AIQNAANSWN GQSSFKFNYA GTSSTTKSGA DGKNVLLFRD LGSSGPIAQA TYWFSSGIIS EADVEFNTYY SWTTDTASGG LKNIEAIAIH ELGHWLNLKD LYGWVPGFPS DLSPIMKVMF GYNGDVIGNT NLKTLSASEI AGIKWIYGGS GPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPTPTL TPTPTPIPVP VVTGIIPESG QAGLTINYTV TGSNFVDGAF VHLVKEGQTN ITSTGTLKEG KLTGIFNLPL DAITGPWNVI VEQDELFSND NILFTITPAP ADTPVITSIN PDSGEKGTEI IYSITGEHLL NGAIVNLTHE GEENITSIGY LMGSQLFGTF VIPTDALTGP WNASVNQNGL YSNDEIQFTI TDGPVHIPVV HNITPDSGMQ GESTDYLLQG ENLTDGALVN LSHPEQVNIS STGNLSEGNL TGTIAIPDDA LPGPWNVTVN QSGLTSNDNV QFIVLPSGPF PVVRSIAMSF AAPGKGSGFV VYGENFENGA IVNLSHSGEK NITAIGELVK GTLTGTFSIP ESCRPGLWNV TVNVRGKVSN DNVQYPIKGK LR
|
| |