Gene MCA2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2747 
Symbol 
ID3104188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2937676 
End bp2939691 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content66% 
IMG OID637171881 
Productcellulose-binding domain-containing protein 
Protein accessionYP_115146 
Protein GI53803113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA GATTTCTTGC CCCCCTTTCC GGCCTGGTCC TGGTGGCCAC GCTCGCGGGA 
GGAACACCGG CGCAGGCCGC CACCCTGCCC GCGCCGGTAC CCAACGCGAT CGTCAGCGCG
TCTTCCTCCT GGGGCACCGC GAGCGATTCC TGGGCCGGCT ACACCGGCGT CCTGCAAATC
TGGGTGCCCG ACGCCGTCAG CGGCGGCTGG ACCCTGACGT TCCAGTCGGC CGGCCTCGGC
CGGCAGGCAC AGGTCTCTTC CTTCTGGAAC GCCAATGCCG TTTTCGACCC TGTCACCAAC
ACGTTCACCC TCACGTCGCC TTCCTGGGGC GGCGACGTGG CGGCAAACAG CGTGCTGGAC
GTCGGCTTCA ACGCCAATGG CGCGTTCGAC ACCTCGGTCG ATCTGGCCAA CTGCAAATTC
AACGGCCAGC CCTGTGTGAT CTCGGCCATG ACGTCCCAGT CGGCCCAGCA GACCCTGGCC
AACCTGAAAG CCGGCTACCA GGGCGGTGGA TCGGCCACGC CGACACCCGC ACCTTCGGCG
ACACCGTCCC CGTCTGCTAC ACCGGTTCCG GTCGCAAGCC CGAAACTGGA AGTGCTGTTT
TCCATCAGCA GCTCGTGGGA TGGCGGCTAC AGCGGCAACG TCGCAGTCAA GAACCTGTCC
TCGAAGACTT TGAAGGCCGG AGCCAACGGC TGGCAGGCCC CGCTGAAATT CCCCGACGCA
GCCACTGCCC AGGACGTGTT CAAGAGCGGA CCTTGGAACT TTTCGGTCAA CATCGCCGGC
GACGGCACCG CAACGCTCAA GCCGAAATCC TGGGCGGCGG CCCTGGCACC CGGCGATGTG
GCGGCAAGCG GCTTCAACGG CGGTTCGCCG GCCAACCTGC AGAAGGCAGC CGCTGCGGAT
TCCACGGTGA CGGTGCTGTT CGCACCCTCG GTGCCGAATT CGAACCCGAC GCCGACCCCG
AATCCGACGG CCACCCCGAG CCCCACGGCT ACCCCGGCCC CGACTCCGGT CGCCTCGGCG
ACGCCGAGTC CGAGCCCGAC GCCGGTACCG ACCACCCCGC CGACCGGAGG CGCCGGCAGC
CTGCTGTTCA GCCCCTACAA GGACGTCACG ATCTCGATGA ACTGGAACAG CAACGTCATG
TCGACCGCGG TGACTGGCAC GCCGTCGCCG CTCCTGAGCG TGCTGCCGGC CAAGGTACCC
GCGGTGACCT GGGCCTTCGC GACCGGCGAG TGCGGCAAGG AGAACTGGGC CGGCATCCAG
CCCGATGCCC TGGTCCAGGC CAACCTGCAA GCCTTCGTCG ACACCGGCAT CGACTACGTC
GTCTCGACCG GCGGCGCGGC AGGCGCTTTC ACCTGTTCCA GCGAAACAGG CATGCGGGCC
TTCATCGACC GCTATGCCTC GAGCCGTCTG GTGGGCATCG ATTTCGACAT CGAAGCCGGC
CAGAGCCAGG CCACCATCGC CAGCCTGGTC CGGCAGGTCG CCGCAGTGCA GTCGGACTAC
CCCAACCTGC GCTTCAGCTT CACCGTGGCC ACCTTGGGTT CGTCCAACGG GACCGTCACG
TCAACCCCCT ACGGCGACCT GAGCGTCACG GGGTACAACG TGGTCAAGGC GGTTCAGCAA
TATGGCGTCG CCAACTACAC CATCAACCTG ATGGTCATGG ACTACGGCAC GGCCAACGCC
GGCAACTGCG TGGTCGTGAA CGGCAAGTGC GACATGGGTC AGACCGCCAT CCAGGCGGCC
AAGAACCTGA AGGCCAGGTA CGGCATCCCC TACGAGCGGA TCGAGCTCAC ACCGATGATC
GGCGTGAACG ATGTCACCGA TGAACTGTTC TCGCTGCAGG ACACCGGCAC CATGGTGCAG
TGGGCGCTCG CCAACGGCAT CGCCGGCATC CACTTCTGGT CGGTCGACCG CGACACGCCG
TGCAGCCAGA CCTCGGCCTC GCCGATCTGC AGCTCGGTTC CCAGCGTGCC GGCCTGGGGC
TACACCAACC GCTTCATCGG CGACTTGGGC TTGTAA
 
Protein sequence
MNTRFLAPLS GLVLVATLAG GTPAQAATLP APVPNAIVSA SSSWGTASDS WAGYTGVLQI 
WVPDAVSGGW TLTFQSAGLG RQAQVSSFWN ANAVFDPVTN TFTLTSPSWG GDVAANSVLD
VGFNANGAFD TSVDLANCKF NGQPCVISAM TSQSAQQTLA NLKAGYQGGG SATPTPAPSA
TPSPSATPVP VASPKLEVLF SISSSWDGGY SGNVAVKNLS SKTLKAGANG WQAPLKFPDA
ATAQDVFKSG PWNFSVNIAG DGTATLKPKS WAAALAPGDV AASGFNGGSP ANLQKAAAAD
STVTVLFAPS VPNSNPTPTP NPTATPSPTA TPAPTPVASA TPSPSPTPVP TTPPTGGAGS
LLFSPYKDVT ISMNWNSNVM STAVTGTPSP LLSVLPAKVP AVTWAFATGE CGKENWAGIQ
PDALVQANLQ AFVDTGIDYV VSTGGAAGAF TCSSETGMRA FIDRYASSRL VGIDFDIEAG
QSQATIASLV RQVAAVQSDY PNLRFSFTVA TLGSSNGTVT STPYGDLSVT GYNVVKAVQQ
YGVANYTINL MVMDYGTANA GNCVVVNGKC DMGQTAIQAA KNLKARYGIP YERIELTPMI
GVNDVTDELF SLQDTGTMVQ WALANGIAGI HFWSVDRDTP CSQTSASPIC SSVPSVPAWG
YTNRFIGDLG L