Gene MCA1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1410 
Symbol 
ID3102884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1497016 
End bp1498221 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content63% 
IMG OID637170586 
Producttetratricopeptide repeat protein 
Protein accessionYP_113868 
Protein GI53804259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0344069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAAC TGCTGACCCT GTTGCTTCCG GTGGCTGCCG CTTCCGGTTG GTATGCAGCG 
GCCAGACACC ACGGACGAAA CCTGTCGGCC GGACTGAATG ACGGGCTCCA GCGGGCCTAT
CAGACCCCCA CCGATACGGC CATCGGAGCC AGAGCCGATG AGGCGTTTCA TCTGCTCAGA
CAGATCGCTG ATTCCAGCGC CAAGACCCTG GAGTTGCAAC TGGCCCTCGG CGGCCTGCTG
CGCAGGAGTG GAGAGCTTTC CAAGGCGATC GAACTGCACG AACGCCTCCA TTCCCAACCC
CAGATGTCGG ACGAACAGCT GCACGCCATT CGTTTCGAAC TGGGCATGGA TTATCTGAGC
GCCGGGCTGC TGGATCGGGC CGAAACCGTC TTCGCGGGCC TGACCGCGAC TGCTTCCCAC
GGCAAGGCAT CGCTCCAGCA AATGCTGGCG ATCTACCAGA GCGAAAAGGA TTGGGTGAGG
GCGGCCGAAT GCGCCCGGTC ACTCAAGAAG TTCGACGCCG GCCAGCGCAA CGCCACGTTA
GCCCATCTCC TGTGCGAGCA GGCCGAATCG GCGATCGCCC GGGGAGAAAC AGTCGCGGCC
CATGCGCATC TTCAGCAGGC CCTGGCCGAA GATCCGCGCT GCGTCCGGGC GACCCTCCTG
AAATCAAGGC TCGCTCTGCA ACGACAGCAA TGGGCGGAAG CGGGTACCCT GCTACGGTCG
GTGGAATTCC AAAATCCCGC CTTTCTGCCG GAGGTCATAG GGCAGTTGCG GCTCTGCCAT
GCCAGTCTTC GGGACATGGA CGGGTATTTG ACCTATCTCG ACTATGTCTA TCAGCGTTAT
CGTACAGAAG CGGCCGCCCT ACTTCTGGCC GACGAACTGG CGCAACGGGA AGGGGCGCCC
GCGGCGGCGA GCTATCTGAC GGGTCTTCTG TCCGAGCGGT CGAGCCTGAA CCTGATCCGC
CGCACGCTGC ATTATGCCGG AGCAGCCACC TGCATCGACG GTGATTGCCT GTCGATCCTG
CGGCGCTGCC TGACCGCCCT CGACGGCCTC GCGTCCATGC ATCCCGGTTA CGGTTGCGTA
CAGTGCGGCT ATGAATGCCG TGAGCTGCAT TGGCACTGTC CCACCTGCCG CGCCTGGGAA
ACCATCCAGC CGGCCGATCC GGGTGTCGGT TTCAGCGGCA AGACCCCAGC GACAACCCCA
TCTTGA
 
Protein sequence
MLELLTLLLP VAAASGWYAA ARHHGRNLSA GLNDGLQRAY QTPTDTAIGA RADEAFHLLR 
QIADSSAKTL ELQLALGGLL RRSGELSKAI ELHERLHSQP QMSDEQLHAI RFELGMDYLS
AGLLDRAETV FAGLTATASH GKASLQQMLA IYQSEKDWVR AAECARSLKK FDAGQRNATL
AHLLCEQAES AIARGETVAA HAHLQQALAE DPRCVRATLL KSRLALQRQQ WAEAGTLLRS
VEFQNPAFLP EVIGQLRLCH ASLRDMDGYL TYLDYVYQRY RTEAAALLLA DELAQREGAP
AAASYLTGLL SERSSLNLIR RTLHYAGAAT CIDGDCLSIL RRCLTALDGL ASMHPGYGCV
QCGYECRELH WHCPTCRAWE TIQPADPGVG FSGKTPATTP S