Gene MCA3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA3110 
Symbol 
ID3103352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3298079 
End bp3301144 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content59% 
IMG OID637172236 
Producthypothetical protein 
Protein accessionYP_115497 
Protein GI53802722 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGGAGG TCAGATTCAA TCCCTTTTCA GGCAGGTTGG AGTTCGATCA GATGGCATTG 
GCCACCCTTG ATGGCCAATC CGCCCTGACG GCGGAGCACG TCTTTTTCGA TATCGCCGTG
GCGGCCAGCC TGAGAAGCGG GTATTTGGTG GTGGAGGGCA TGGCGGACCA GGTCGGTCTC
CGTATATGTC TGGATGCCAA GGGGGTTTCC AACTGGACCA AGCTCATGGG AGGGACCGGA
GAGTCGCCTC CTCCACGGGT GCCTTTTGGT GTCGAACGAT TTTCCATCAG AAACGCTGGC
GTGGAATTCA TCGATGAAGG TAGCAAGGTC CACCTCGGCG CCTCGGAGGT GAATGCCGAT
CTTTACGGTC TGGGTCCGGA TTTGAGCGAG CCGGCGCGCC TGGAAATGAA GGCTTCCATT
GCAGGCAGGG CCAAGGTCAG CGGCAATTCG GAACTCACGC TGATTCCTTT TCATGTGAAT
CCAGTCGTCT ACCTGGATGA TTTCGACCTG ACATCCATCG CGGCTTATCT CGAAGGGATG
GGCGTCGTGA TCAAGCAAGG AAGGATGAAA GGAAATCTGG CGTTTGAATA TGGATCGGAT
GAGGCAGGAC AAGTGTTCAG GCTGGGTAAA TCGAATCTTA TATGGCATGA CGTTCAATGG
GGCATGAAAA ACGGTCCGGA CGCAGATTGG GAGGTCGACG ACATCCGCCT CGACGGGCTG
GAATACGATG GTGCTACGGC GACGCTGAGG TTGGCCAAGT CACATATAGA TACTCTGGCA
GTTGCTTCCC GGAATGGGGT CAAGGTCCGG ATTCGTGACG CGGATTTGGG AAAGCTGTTG
CTCGATTTGT ACAGGTTTTC GGCGCAGGCC GGGAGCATCG CCATACGTAC CGTCGACCTG
ACCCAGGACC GCGGGTCTGG AAGCATGGCG GCGGCAGTTT TCAATCTCTG GTCCAATGGT
TTGGGATGGA GTGAAACCCG GCTGGCGGCG AGAGGATTAG GATGGGAGGG CATGAGGCTC
TGGGATTCTT CCGCGACCTT GGGTGAACCA CCGGAGATTC GTTTCAGAAA AATGTCAGCG
GACGATGTGG CGGCCGATTT CGTAAAAAGG TCTTTTTCGA TGGCCAAGTT CGATTCCGCC
GATGCTGAAA TCAGTGCCTG GATTTCGCCA AAAAGGGAAT TCGAAATTCC TGGTTTTCTC
GGTGACTGGG CCGGCGTGAT ACGTCCTGGC TCTTCTAACG AAGGCTGGGC GTTCAGTCTG
GGGGAAGGAG TGATCCGAAA TTACCGGCTC AACCTCGCCG ACCATGGCGT CGATCCGCCA
GCCCGTATCG GCCTCGATGA TTTGGAAATA CGGCTCCAGG GAGTGGATAC TCGGCAGGGA
AAATTCGCGC TCCGTCTGGA ATCTGTCGTG GACCGTAAGG GCAGGATCAG CGTGCAGGGT
TCCGGGAGTT TCGATCCACC GGAAGCCGAA CTGCGCCTGC AAGTGGAAAA CCTCGGTCTG
CGGCCGTTCC GATCCTATCT CGACGATTTC GCCCGGATTG ATCTGGCCAA GGGACGGCTC
AACCTCGAAG GGGGGCTGGC ATACCGTCCG GTCGGGAACG ACGTTCGTTT CAGCGGAACG
GCCGAAATCG CCGGCCTGGT TACCGTGGAC AGGAAGGATG GAAGGGATTT CATTCACTGG
CGGTCCTTGC GCGCGGAAGG ACTGACACTG GAAACGTCCG CGAACCGGCT GAGCATACGC
CAACTGGTCG CCGACAGGCC CTATGCCCGT ATCGTCGTGA GCCGGCAACG TACGCTCAAT
TTGATCGAAA ATCTGTTTCA GCCGCGTTCG AAGCCCGCTG GTCCGTCTGG GCAGGCGAGC
CGGCCGTTCG CGGTCACGGT GGGTTCCCTC CTGGTTCGCG ATGGCTCCGC GGATTTTTCC
GACCTCAGTC TCCAGCCGAG CGTTTCCGTC GATATTCGCG GTCTGACCGG TGTCGTCCAG
TCTTTGTCGT CCAGGCCGGA CGCCGAGGCG GAAGTCTCGA TCAAAGGCAG CATCAGCGAT
ACCTCACCGG TGACCATCAG CGGCCGGATC AACCCGTTCC TGTTCGGTAC CTTCGCCGAC
CTCAGCATAC GCTTCAAGAA CGTCGATCTC ACCGAACTCT CGCCTTATTC CGCCCGCTTC
GCCGGTTATC GCATCGATAA GGGCAAGGCC GACCTGGATC TGCATTACCG GCTTCGCGAC
CGGAAACTGC TGGCCGACAA CAACCTGGTG TTCGACCATC TGACCCTGGG TGAGCGCGTC
GACAGCCCGG AGGCGATTTC CCTGCCGGTG AAGCTCGCGG TGTCCCTGAT GCGGGGGCTG
GACGGCAAGA TCAACATCGA TTTGCCCATC AGCGGCAATT TGGACGATCC TAAGTTCAGC
ATCACCGGCT TGTTGACGAA AGCTGCCGTG GGGGTGATCA CCAAGGTGGT CAGCTCGCCG
TTTTCAGCGA TCGGCATGTT GTTCGACGGC GGCAGTGACG ATGCCGGCTC GATCGATTTC
CGTCCTGGTT CGTTCGAACT GGAGGGTGCC GAAAAGAGTA GGCTGGACGG ACTGGCGACC
GCGCTTTCGC AACGCCCCGG CCTGTCCCTG GAAATCCGCG GAACAGCCCG GAGCGGCAGG
GATGCGAGCG CGCTCGCAGA ACAGCAGTTG CGGCGGCAGC TGGAGAACGC CAAAGCCATC
GAACTGCGTC TTGCTGGAGG AGACCGAGAC AGGGCGCCGG CCGGTTCCTC GGTCCTGTCC
GGGGAGGACT ATCGCCGCTT GTTCAGTCAC TTCTACCGTC TCCGTTACCC CGGAGCAGCG
GAATGGGCGG CGTTGCCGCG AGGCGAGCGG GTGCTTGGGG GGGAGCTTTT TGAAAGCGCC
CGGGGGAAAG TGTTGAAGGA CTGGTCCATC AGCGAAATCG ATCTGCGCCG TCTGGCGCAG
GCGCGCGCCG CGGCAGTACG CAGCTATCTC GTGCAGAAAG GGATCGAACC GACGCGGATT
TATCTGCTCG ACGTGGAGCT GACCCCCGGT GACGGTGACA CCATCGCCCT CTTGAGCCTG
AGTTGA
 
Protein sequence
MTEVRFNPFS GRLEFDQMAL ATLDGQSALT AEHVFFDIAV AASLRSGYLV VEGMADQVGL 
RICLDAKGVS NWTKLMGGTG ESPPPRVPFG VERFSIRNAG VEFIDEGSKV HLGASEVNAD
LYGLGPDLSE PARLEMKASI AGRAKVSGNS ELTLIPFHVN PVVYLDDFDL TSIAAYLEGM
GVVIKQGRMK GNLAFEYGSD EAGQVFRLGK SNLIWHDVQW GMKNGPDADW EVDDIRLDGL
EYDGATATLR LAKSHIDTLA VASRNGVKVR IRDADLGKLL LDLYRFSAQA GSIAIRTVDL
TQDRGSGSMA AAVFNLWSNG LGWSETRLAA RGLGWEGMRL WDSSATLGEP PEIRFRKMSA
DDVAADFVKR SFSMAKFDSA DAEISAWISP KREFEIPGFL GDWAGVIRPG SSNEGWAFSL
GEGVIRNYRL NLADHGVDPP ARIGLDDLEI RLQGVDTRQG KFALRLESVV DRKGRISVQG
SGSFDPPEAE LRLQVENLGL RPFRSYLDDF ARIDLAKGRL NLEGGLAYRP VGNDVRFSGT
AEIAGLVTVD RKDGRDFIHW RSLRAEGLTL ETSANRLSIR QLVADRPYAR IVVSRQRTLN
LIENLFQPRS KPAGPSGQAS RPFAVTVGSL LVRDGSADFS DLSLQPSVSV DIRGLTGVVQ
SLSSRPDAEA EVSIKGSISD TSPVTISGRI NPFLFGTFAD LSIRFKNVDL TELSPYSARF
AGYRIDKGKA DLDLHYRLRD RKLLADNNLV FDHLTLGERV DSPEAISLPV KLAVSLMRGL
DGKINIDLPI SGNLDDPKFS ITGLLTKAAV GVITKVVSSP FSAIGMLFDG GSDDAGSIDF
RPGSFELEGA EKSRLDGLAT ALSQRPGLSL EIRGTARSGR DASALAEQQL RRQLENAKAI
ELRLAGGDRD RAPAGSSVLS GEDYRRLFSH FYRLRYPGAA EWAALPRGER VLGGELFESA
RGKVLKDWSI SEIDLRRLAQ ARAAAVRSYL VQKGIEPTRI YLLDVELTPG DGDTIALLSL
S