Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC5_0396 |
Symbol | |
ID | 4928243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C5 |
Kingdom | Archaea |
Replicon accession | NC_009135 |
Strand | - |
Start bp | 356322 |
End bp | 358376 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640165900 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001096926 |
Protein GI | 134045440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0162411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TATGTACATT ACTATTAGTA TTTGCACTTG TTTCCGGCTT GAATATTGCT TATGCAGATT CTGCTCCTAG CTTACCCCAT ACAATTTATG GGGACGTATC AATTAATGGA CTTCCTGCAA CAGGAACGTT AAAAGTACTT GTAAACGGAG TGGAAAGTGA GCAAGTGCAG GTTAATGATG GAGAGTTTGG TAAGGGATTA TTTGATCCCA AACTAGTTGT TAGTGGAGCT TCAGGGGATG AACTTACATT TTCATTTGAA GCTGAAAGTT ATACAATAAA TCCAGCTTAT AATATATACC TCGTAGATTC AGCTCAGTAC GTTTCAGAAA TTGACTTCGC ATCTGGAGGG TATACTCAAG TATTATTAGA ATTCACCGGA ACGGGTGATA CCGGTGATAC TGGAGATACC GGTGATACTG GAGATACCGG TGATACTGGA GATACCGGTG ATACTGGAGA TACCGGTGAT ACTGGAGATA CCGGTGATAC TGGAGATACC GGTGATACTG GAGATACCGG TGATACTGGA GATACCGGTG ATACTGGTGA CAGTGGATCA ATGCCACTTA ATCCCGAATT ATTCTACGGT ATCGCAACAA TTGGGGAAAC CAGTGCTTCA GGAACTTTAA ACGTTTATGT TGATGACGTA CTTCAAGATT CAATTGAAGT TCAAAACGGA TTATTTGGGG GCTCAGGACC GCTTTCAGAA AAACTCGTCG CAACAGGATA CGTTGGAGAA TCCAATGAAG TTAAATTCAC CTTAGTTTCT GGAGAGGAAA CTTATTCAAG TTTTATTGCG GAAATCGGTG AAGAAACTTA CACAGATGAA CTTCCATACG TTGAAGGGGT ACAAAACATT GATATTGAGT TTTCAGAAGT TACTGGAGAT GCCGGTGATA CTGGAGATAC CGGTGATACT GGAGATACCG GTGACAGTGG ATCAATGCCA CTTAATCCCG AATTATTCTA CGGTATCGCA ACAATTGGGG AAACCAGTGC TTCAGGAACT TTAAACGTTT ATGTTGATGA CGTACTTCAA GAGTCAATTG AAGTTCAAAA CGGATTATTT GGAGGGCCGT GCCCACTTGC AGAAAAACTC GTCGCAACAG GATACGTTGG AGAATCCAAT GAAGTTAAAT TCACCTTAGT TTCTGGAGAG GAAACTTATT CAAGTTTTAT TGCGGAAATC GGTGAAGAAA CTTACACAGA TGAACTTCCA TACGTTGAAG GGGAAGCGTA TTACATGGAA ATAAGTTTTT CTGAATCAGC AAATACGGAA GATAGCAATA GTAATTCAAA TACTAATGAA ACAAATAATT CGTCAATGCC ACTTTATCCA GAATTATTCT ACGGACTCGT TTATCTTGAT GATACATTAC CTTCCAGCAC ATTGAATGTG TATGTTGACG ATGTACTTCA AGATTCAATT GAAATTGAAA ATGGAGTATT TGGGGGAGAA GGGCCTCTTG ATGATAAACT GACTGCGACA GGTTACGAAA GCAGTAGTAA CGTAGTCACG TTTTCACTAG TTTCTGGAGA GGAAACTTAT TCAAGCTTTA CTGCAGAATT GTCAAATGCT ACTTACGAAA ATGAAGTTCC TTACGATGAA GGAGTACATT ACGTAATAAT CACATTCTCA AGTAAAGCCA CTGAAACTGG GGACTCTGGA AGTACAGGAG GTGGCGGATC TAGCGGAAGT AGTTCAGATG ACTCTTCATC GACAGTTATT ATCAGTTCAG ATTCGTCAGA AACTTCAGCC ACTACCAAAA ATTCAGATTC AGGAACTTTG ATGAAGACCA CTTCATCAGC AAATCCTGGT GAAACTTCGG AAGATACCGA ACAGACCACT GCGAAAAATT CACAGGATGT AACTTCAGAT AGTGAAACAT ACGATAATGA AACAGGAGTT GTGTTACAGC AGGAAAGCCC ACTTGGTGGA ATAAACATTT ACCTTGCTAT GGCTGCAATA TTATTGATAT TGATTGCACT AGCTGCAGCA TGGTATCAGT CAAGAGAAAA ACCAGAAGTT TTACCTCAGC CATAA
|
Protein sequence | MKKICTLLLV FALVSGLNIA YADSAPSLPH TIYGDVSING LPATGTLKVL VNGVESEQVQ VNDGEFGKGL FDPKLVVSGA SGDELTFSFE AESYTINPAY NIYLVDSAQY VSEIDFASGG YTQVLLEFTG TGDTGDTGDT GDTGDTGDTG DTGDTGDTGD TGDTGDTGDT GDTGDTGDTG DTGDTGDSGS MPLNPELFYG IATIGETSAS GTLNVYVDDV LQDSIEVQNG LFGGSGPLSE KLVATGYVGE SNEVKFTLVS GEETYSSFIA EIGEETYTDE LPYVEGVQNI DIEFSEVTGD AGDTGDTGDT GDTGDSGSMP LNPELFYGIA TIGETSASGT LNVYVDDVLQ ESIEVQNGLF GGPCPLAEKL VATGYVGESN EVKFTLVSGE ETYSSFIAEI GEETYTDELP YVEGEAYYME ISFSESANTE DSNSNSNTNE TNNSSMPLYP ELFYGLVYLD DTLPSSTLNV YVDDVLQDSI EIENGVFGGE GPLDDKLTAT GYESSSNVVT FSLVSGEETY SSFTAELSNA TYENEVPYDE GVHYVIITFS SKATETGDSG STGGGGSSGS SSDDSSSTVI ISSDSSETSA TTKNSDSGTL MKTTSSANPG ETSEDTEQTT AKNSQDVTSD SETYDNETGV VLQQESPLGG INIYLAMAAI LLILIALAAA WYQSREKPEV LPQP
|
| |