Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC6_1479 |
Symbol | |
ID | 5737672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C6 |
Kingdom | Archaea |
Replicon accession | NC_009975 |
Strand | - |
Start bp | 1379998 |
End bp | 1382151 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641283980 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001549524 |
Protein GI | 159905862 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.289213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TATGTACATT ACTATTAGTA TTTGCACTTG TTTCCGGCTT GAATATGGCT TATGCAGATT CTGCTCCTAG CTTACCCCAT ACAATTTATG GGGATGTATC TATTAATGGA CTTCCCGCAA CGGGAACGTT AAAAGTACTT GTAAACGGAG TGGAAAGTGA GCAAGTGCAA GTTACTGATG GAGAGTTCGG TAAGGGATTA TTTGATCCCA AACTAGTTGT TAGTGGAGTG TCAGGGGATA AACTTACATT TTCATTTGAA GCTGAAAGTT ATACAATAAA TCCATCTTAT AATATATATC TTGTAGATTC AGCTCAGTAC GTTTCAGAAA TCGACTTCGC ATCTGGAGGA TATACTAAAA TATTATTAGA ATTCACAGGA ACGGCTGACA GTGGTGATAC TGGAGATACC GGTGACAGTG GTGATACTGG AGATACCGGT GACAGTGGCG ATACTGGAGA TACCGGTGAC AGTGGCGATA CTGGAGATAC CGGTGACAGT GGCGATACTG GAGATACCGG TGACAGTGGC GATACTGGAG ATACCGGTGA CAGTGGCGAT ACTGGAGATA CCGGTGACAG TGGCGATACT GGAGATACCG GTGACAGTGG TGATACTGGA GATACCGGTG ACAGTGGCGA TACTGGAGAT ACCGGTGACA GTGGATCAAT GCCACTTAAT CCAGAATTAT TCTACGGTTT CGCAACAATT GGGGAAACCA GTGCTTCAGG AACTTTAAAC GTCTATGTTG ACGATGTACT TCAAGATTCG ATTGCAGTTC AGAATGGATT ATTTGGGGGA TCAGGACCGC TTGCAGAAAA ACTCGTCGCA ACTGGATATG TTGGAGAAAC CAATGAGGTT AGGTTCACCC TAGTTTCTGG AGGAGAAACT TATTCAAGCT TTAGTGCGGA AATAGGTGAA GAAACTTACA CAGATGAACT ACCATACGTT GAAGGAGTAA AAAACATTGC GATTGAATTT TCAGGAAGTA CAGGAATCTC AGGCGATACT GGAGATACTG GAGATACCGG TGACAGTGGA TCAATGCCAC TTAATCCAGA ATTATTCTAC GGTTTCGCAA CAATTGGGGA AACCAGTGCT TCAGGAACTT TAAACGTCTA TGTTGACGAT GTACTTCAAG ATTCGATTGC AGTTCAGAAT GGATTATTTG GGGGATCATG CACACTTGCA GAAAAACTCG TCGCAACTGG ATATGTTGGA GAAACCAATG AGGTTAGGTT CACCCTAGTT TCTGGAGGAG AAACTTATTC AAGCTTTAGT GCGGAAATAG GTGAAGAAAC TTACACAGAT GAACTACCAT ACGTTGAAGG GGAAGCGTAC TACATGGAAA TAAGTTTTTC TGAATCAACA GGTGCGGTAG ATAACAATAG TAATTCAAGT AATGATGAAA CAAGTGATTC GTCAATGCCA CTTTATCCAG AATTATTCTA TGGACTCGTG TATCTGGACG ACACCTTGGC ATCCAGTACA TTGAATGTAT ATGTTGACGA TGTACTTCAA GATTCAATTG AAATTGAAAA TGGAATATTT GGAGGAGAAG GACCTCTTGA TGATAAACTG ACTGCAACAG GTTATGAAGG TAATAGTAAC GTAGTCACGT TTTCACTAGT TTCTGGAGGA GGAACTTATT CAAGCTTTAC TGCAGAATTG TCAGATGCCA CTTACGAAAA TGAAATTCCT TATGATGAGG GAGTACATTA CGTAATACTC ACATTCTCAA GTGAAGCTAC TGAAACAGGG GACTCCGCAA GTACTGGAGG TAGCGGATCT GGAGGAAGTA GTTCAGGTGG CTCTTCGTCA TCAGTTATTA TCAGTTCAGA TTCATCAGAA CCATCAGCCA CTACCAAAAA TTCAGATTCA GGAACTTTGA CTAAAACTAC TTCATCAGCA AATCCTGCTG AAACTTCAGA AAATACTGAA CAGAATGCTG CGAAAAATTC ACTGGATATA TCATCTGATG ATGAAACATA CAGTAATGAA ACGGGAGTAG TTTTACAGCA AGAAAGTCCA CTTGGCGGAA TAAACCTTTA CCTTGCTATG GCGGCAGCCT TGCTAATATT GATTGCGATA GCTGCAGCAT GGTATCAGTC AAGAGAAAAA CCAGAAGTTT TGCCTCAGCC ATAA
|
Protein sequence | MKKICTLLLV FALVSGLNMA YADSAPSLPH TIYGDVSING LPATGTLKVL VNGVESEQVQ VTDGEFGKGL FDPKLVVSGV SGDKLTFSFE AESYTINPSY NIYLVDSAQY VSEIDFASGG YTKILLEFTG TADSGDTGDT GDSGDTGDTG DSGDTGDTGD SGDTGDTGDS GDTGDTGDSG DTGDTGDSGD TGDTGDSGDT GDTGDSGDTG DTGDSGDTGD TGDSGSMPLN PELFYGFATI GETSASGTLN VYVDDVLQDS IAVQNGLFGG SGPLAEKLVA TGYVGETNEV RFTLVSGGET YSSFSAEIGE ETYTDELPYV EGVKNIAIEF SGSTGISGDT GDTGDTGDSG SMPLNPELFY GFATIGETSA SGTLNVYVDD VLQDSIAVQN GLFGGSCTLA EKLVATGYVG ETNEVRFTLV SGGETYSSFS AEIGEETYTD ELPYVEGEAY YMEISFSEST GAVDNNSNSS NDETSDSSMP LYPELFYGLV YLDDTLASST LNVYVDDVLQ DSIEIENGIF GGEGPLDDKL TATGYEGNSN VVTFSLVSGG GTYSSFTAEL SDATYENEIP YDEGVHYVIL TFSSEATETG DSASTGGSGS GGSSSGGSSS SVIISSDSSE PSATTKNSDS GTLTKTTSSA NPAETSENTE QNAAKNSLDI SSDDETYSNE TGVVLQQESP LGGINLYLAM AAALLILIAI AAAWYQSREK PEVLPQP
|
| |