Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2747 |
Symbol | |
ID | 3104188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2937676 |
End bp | 2939691 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637171881 |
Product | cellulose-binding domain-containing protein |
Protein accession | YP_115146 |
Protein GI | 53803113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA GATTTCTTGC CCCCCTTTCC GGCCTGGTCC TGGTGGCCAC GCTCGCGGGA GGAACACCGG CGCAGGCCGC CACCCTGCCC GCGCCGGTAC CCAACGCGAT CGTCAGCGCG TCTTCCTCCT GGGGCACCGC GAGCGATTCC TGGGCCGGCT ACACCGGCGT CCTGCAAATC TGGGTGCCCG ACGCCGTCAG CGGCGGCTGG ACCCTGACGT TCCAGTCGGC CGGCCTCGGC CGGCAGGCAC AGGTCTCTTC CTTCTGGAAC GCCAATGCCG TTTTCGACCC TGTCACCAAC ACGTTCACCC TCACGTCGCC TTCCTGGGGC GGCGACGTGG CGGCAAACAG CGTGCTGGAC GTCGGCTTCA ACGCCAATGG CGCGTTCGAC ACCTCGGTCG ATCTGGCCAA CTGCAAATTC AACGGCCAGC CCTGTGTGAT CTCGGCCATG ACGTCCCAGT CGGCCCAGCA GACCCTGGCC AACCTGAAAG CCGGCTACCA GGGCGGTGGA TCGGCCACGC CGACACCCGC ACCTTCGGCG ACACCGTCCC CGTCTGCTAC ACCGGTTCCG GTCGCAAGCC CGAAACTGGA AGTGCTGTTT TCCATCAGCA GCTCGTGGGA TGGCGGCTAC AGCGGCAACG TCGCAGTCAA GAACCTGTCC TCGAAGACTT TGAAGGCCGG AGCCAACGGC TGGCAGGCCC CGCTGAAATT CCCCGACGCA GCCACTGCCC AGGACGTGTT CAAGAGCGGA CCTTGGAACT TTTCGGTCAA CATCGCCGGC GACGGCACCG CAACGCTCAA GCCGAAATCC TGGGCGGCGG CCCTGGCACC CGGCGATGTG GCGGCAAGCG GCTTCAACGG CGGTTCGCCG GCCAACCTGC AGAAGGCAGC CGCTGCGGAT TCCACGGTGA CGGTGCTGTT CGCACCCTCG GTGCCGAATT CGAACCCGAC GCCGACCCCG AATCCGACGG CCACCCCGAG CCCCACGGCT ACCCCGGCCC CGACTCCGGT CGCCTCGGCG ACGCCGAGTC CGAGCCCGAC GCCGGTACCG ACCACCCCGC CGACCGGAGG CGCCGGCAGC CTGCTGTTCA GCCCCTACAA GGACGTCACG ATCTCGATGA ACTGGAACAG CAACGTCATG TCGACCGCGG TGACTGGCAC GCCGTCGCCG CTCCTGAGCG TGCTGCCGGC CAAGGTACCC GCGGTGACCT GGGCCTTCGC GACCGGCGAG TGCGGCAAGG AGAACTGGGC CGGCATCCAG CCCGATGCCC TGGTCCAGGC CAACCTGCAA GCCTTCGTCG ACACCGGCAT CGACTACGTC GTCTCGACCG GCGGCGCGGC AGGCGCTTTC ACCTGTTCCA GCGAAACAGG CATGCGGGCC TTCATCGACC GCTATGCCTC GAGCCGTCTG GTGGGCATCG ATTTCGACAT CGAAGCCGGC CAGAGCCAGG CCACCATCGC CAGCCTGGTC CGGCAGGTCG CCGCAGTGCA GTCGGACTAC CCCAACCTGC GCTTCAGCTT CACCGTGGCC ACCTTGGGTT CGTCCAACGG GACCGTCACG TCAACCCCCT ACGGCGACCT GAGCGTCACG GGGTACAACG TGGTCAAGGC GGTTCAGCAA TATGGCGTCG CCAACTACAC CATCAACCTG ATGGTCATGG ACTACGGCAC GGCCAACGCC GGCAACTGCG TGGTCGTGAA CGGCAAGTGC GACATGGGTC AGACCGCCAT CCAGGCGGCC AAGAACCTGA AGGCCAGGTA CGGCATCCCC TACGAGCGGA TCGAGCTCAC ACCGATGATC GGCGTGAACG ATGTCACCGA TGAACTGTTC TCGCTGCAGG ACACCGGCAC CATGGTGCAG TGGGCGCTCG CCAACGGCAT CGCCGGCATC CACTTCTGGT CGGTCGACCG CGACACGCCG TGCAGCCAGA CCTCGGCCTC GCCGATCTGC AGCTCGGTTC CCAGCGTGCC GGCCTGGGGC TACACCAACC GCTTCATCGG CGACTTGGGC TTGTAA
|
Protein sequence | MNTRFLAPLS GLVLVATLAG GTPAQAATLP APVPNAIVSA SSSWGTASDS WAGYTGVLQI WVPDAVSGGW TLTFQSAGLG RQAQVSSFWN ANAVFDPVTN TFTLTSPSWG GDVAANSVLD VGFNANGAFD TSVDLANCKF NGQPCVISAM TSQSAQQTLA NLKAGYQGGG SATPTPAPSA TPSPSATPVP VASPKLEVLF SISSSWDGGY SGNVAVKNLS SKTLKAGANG WQAPLKFPDA ATAQDVFKSG PWNFSVNIAG DGTATLKPKS WAAALAPGDV AASGFNGGSP ANLQKAAAAD STVTVLFAPS VPNSNPTPTP NPTATPSPTA TPAPTPVASA TPSPSPTPVP TTPPTGGAGS LLFSPYKDVT ISMNWNSNVM STAVTGTPSP LLSVLPAKVP AVTWAFATGE CGKENWAGIQ PDALVQANLQ AFVDTGIDYV VSTGGAAGAF TCSSETGMRA FIDRYASSRL VGIDFDIEAG QSQATIASLV RQVAAVQSDY PNLRFSFTVA TLGSSNGTVT STPYGDLSVT GYNVVKAVQQ YGVANYTINL MVMDYGTANA GNCVVVNGKC DMGQTAIQAA KNLKARYGIP YERIELTPMI GVNDVTDELF SLQDTGTMVQ WALANGIAGI HFWSVDRDTP CSQTSASPIC SSVPSVPAWG YTNRFIGDLG L
|
| |