Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2914 |
Symbol | |
ID | 3104506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 3103485 |
End bp | 3106466 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637172043 |
Product | prophage MuMc02, TP901 family tail tape measure protein |
Protein accession | YP_115307 |
Protein GI | 53802953 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAG AGCTTGCCTT CGGGATCGTG ATCGGGGCGG CGCTGTCAAG CGCCTTTACC GCCGCCTTCG GCAATGCCAA GAAGACCATC GACACGCTGG GCGCCAGCGT GCGCGACCTG ACTGACAAGC AGAAGACGCT GGGCGCCACG ATTCAGAAGT CCGGCAGGCT GGCGCAAGAA TCCCTGGCGC CGCTTCATCG CGACTACGAA CGGCTGGGGC GTCTGATCGA CGAGATACGA CGGCGACAGG AGGCGCTGAC CGCCAGTCTG GCGCGCGGAG CCGCGCTCAA ACAGGAGCGC GCCGATTTGC GGGGGCAGGC GTTGGAAACC GTCGGAACCG GCGCGGCGCT GGGCGCGCCG GTCGTGGCGT CCGTCCGGCT GGCGGGCAAT TTCCAGGACC AGCTCCGCGA TATCGCCATC ACCGGCGAGT TCACGACCGC GCAGGAAAAC CGCTTGGGCG CCGCGGTGCG TGAGAGCGCC CTGAAGTGGA ACCAGACCCA GGCCGAGATC GCTTCCGGCA TCGGGGTGCT GGTGGCCGGA GGTGTCCAGG ATGCCCAGGC GCTGGACCGG TATACGCCGG TCCTTGCGAA AGCGGCCACG GCGACGCGGG CCAGCATGGC CGATCTTGGC AGCGTGCTGC TGGCATTCGA CAACAATCTC AAGGTTTCCG CGGACCAATC CGAGTCGGCG CTCAACATGC TGGCCTACGC TGGCAAGCGC GGGCAGTTCG AGATCCGCGA TTCCGCCAAA TGGCTGCCGG CGCTGGCGCC GATGTTCCAG ACCCTGGGCG TGACGGGCAA GGAAGCCGTG GCCGAGATCG GTGCGGCACT ACAGATCGCC CGGAAGGGTG CGGGCACCAA CGACGAAGCC GCCAACAACT TCCGCAATTT TCTGGCGAAA CTGACCTCGC CGGACACGTT GAAGGACTTC GACAAGGCCG GTATCGATCT GCAGAGCAGC TTGAAGAATG CCGCCAAGCG CGGCATCAGT CCCATGGAGG CGATGATGGA TACCATCACC GCCTATATCG GCAGCAAAGG CCCGCAGGCG GCCGCGGCAT TCAAGCAGGC TCTGTCCCTG GAAGACGCGC AGAAACGTGC CGAGGCCCTC CAGGCGTTGT CCGGTTCGTT CAGGCTGGGT GAATTGTTCC AGGACATGCA GGCGATGTCC TTCATCCGCC CGATGCTGGC GAACCGGGCC GAGTACAAGG ACATCAAGCA AGGCGCGCTC GGCGCGGCGA ATCAGGACTT GATAGGCGCC GACTTCCAAA AGCGCACCCA GGGTTTCAAT GAAAGCCTGA AAGCCTTCCG TATTGGGATG AGTGAGGTTG GCCTGGTGGT CGGCGAAGCC CTGCTGCCGC CGCTCACGGA TCTGTTGCAG ACCGTCCGTC CCGTCATCCG GGAATTCGGG CAGTTCGCCT CGGCACATCC GGGCGTGATT CGGGGTGTGG TCGGGTTAAC CGCCGGCTTG CTGGGTGGGA AGCTTGCCGT TTTGGCGGTG AGATATGCCG TCAACCTGCT GGTGTCGCCC TTCAATGCGT TGGCTACGGC GACTCAATTG GTATTGGGTA AATGGACACT GCTCAAGACC GCTTTGCAGT TCGGGCCGCT GGCTCGAGCC GGCGGGGCTT TGAGCGCCGC AGGTTCGGCG GCGCTGGGCG CCGGTCGCGC CCTGGGCGGG ATTTTGCTTT CCGGCTTTCG ACTCGCAGGC GCTGCAACGC TGGGGTTTGG ACGCAGCATT GCCAGCCCCT TGCTTTCTGG CCTTCGTTTG GCCGGCGCGG CCGCGGCTGG ATTTGGTCGT ATCCTGATCG GTTCCTTCCT TGCCGGCCTT CGACTGGCGG GCGTGGCGGC GCTGGAACTC GGACGTTTCT CCGCCGGGGC CTTGCTTTCT GGCCTTCGTT TGGCAAGTAC GGCTGCGGCG GGGCTCGGTC GCATTCTGAT CGGCTCAGCT CTCGCCGGCC TTCGGTTGGC CTGGCTGGCC GCGCTAGAGC TCGGACGGAT TCTTGGTGGC GCATTGATTT CCGGTCTCAA GCTGGCGGCT CAGGCCGTCA TGTTCCTCGG CCGCGCACTG CTGTTCACGC CCATTGGCGC AGCCGTTGCC ATCATCGCCG GAGCGGCGTT CCTGATCTGG AAAAACTGGG ATGCATTGAA AGCCAGGTTC GCGCCTTTGT GGGATCAGGT CAAGGGCATC TTTGGCCGGG CGCTGAGCTG GTTCAAGACG CTGCCAGACG TTTTCAAAGC CATCGGCTCG AATCTGCTGG AAGGACTGAG GTCCGGCATC ATGGCCAAAT GGGAGTCGGT CAAGGCAGGG TTGTCGAGCA TTGCTTCCGG CATCAAGGAC ACCTTCAAGA GCGCGCTGGG GATTCATTCG CCTTCGCGCG TATTCGCCGG CTACGGCGCC GACATCGGAC AGGGATTGAT TGAGGGCGTG ACCGGGCAGA AAGACGCCGT GGCGGAGACC CTTGGCAAGC TGGTGCGGTT TCCCCAGCCG CCTGTCATTC GGGTCCACAC CGAACTCGGT TCTTCGACAG GCCCGGAGCG CAGCGGACTG GCGCCGGTAT CCCTCGATGC CGGCCTGGGC CGGGAGCTGG GGCGTGTATT GCCGTTCGCC AAACCCGACT CGAAGGGCAA CGGCGCCGAC TCGAAGGGCA ATGGAGTTGA ATCGCCCGCC GCGACCACCG ACACCCCTGC GCGTCTCTTG GGGCGTAAAT TCGGCGACGC CGCTCTTCGA GTCCGTCAGG GCAGCGATGT CGCCAGTGGC GCGGACGGAC ACGCTCGCAC CTTCCGCTCC GTGCACCAGC CCCAGGGGCG CAGCGGTGAA CAGGTGGTGA TCCACTTCAA CCCGACGATC CACGTCAGCG GCGCGGAGGA TGCCGGCAAG GCCAAGGCAG CCGTCACCGA AGCGCTCGAG TTGTCGCTGC GCGACTTCGA ACGCGTGGCT GCGCAGGCCA GCCATGCCAG GGCGAGGACG GGCTACCGGT GA
|
Protein sequence | MAKELAFGIV IGAALSSAFT AAFGNAKKTI DTLGASVRDL TDKQKTLGAT IQKSGRLAQE SLAPLHRDYE RLGRLIDEIR RRQEALTASL ARGAALKQER ADLRGQALET VGTGAALGAP VVASVRLAGN FQDQLRDIAI TGEFTTAQEN RLGAAVRESA LKWNQTQAEI ASGIGVLVAG GVQDAQALDR YTPVLAKAAT ATRASMADLG SVLLAFDNNL KVSADQSESA LNMLAYAGKR GQFEIRDSAK WLPALAPMFQ TLGVTGKEAV AEIGAALQIA RKGAGTNDEA ANNFRNFLAK LTSPDTLKDF DKAGIDLQSS LKNAAKRGIS PMEAMMDTIT AYIGSKGPQA AAAFKQALSL EDAQKRAEAL QALSGSFRLG ELFQDMQAMS FIRPMLANRA EYKDIKQGAL GAANQDLIGA DFQKRTQGFN ESLKAFRIGM SEVGLVVGEA LLPPLTDLLQ TVRPVIREFG QFASAHPGVI RGVVGLTAGL LGGKLAVLAV RYAVNLLVSP FNALATATQL VLGKWTLLKT ALQFGPLARA GGALSAAGSA ALGAGRALGG ILLSGFRLAG AATLGFGRSI ASPLLSGLRL AGAAAAGFGR ILIGSFLAGL RLAGVAALEL GRFSAGALLS GLRLASTAAA GLGRILIGSA LAGLRLAWLA ALELGRILGG ALISGLKLAA QAVMFLGRAL LFTPIGAAVA IIAGAAFLIW KNWDALKARF APLWDQVKGI FGRALSWFKT LPDVFKAIGS NLLEGLRSGI MAKWESVKAG LSSIASGIKD TFKSALGIHS PSRVFAGYGA DIGQGLIEGV TGQKDAVAET LGKLVRFPQP PVIRVHTELG SSTGPERSGL APVSLDAGLG RELGRVLPFA KPDSKGNGAD SKGNGVESPA ATTDTPARLL GRKFGDAALR VRQGSDVASG ADGHARTFRS VHQPQGRSGE QVVIHFNPTI HVSGAEDAGK AKAAVTEALE LSLRDFERVA AQASHARART GYR
|
| |