Gene MCA1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1330 
Symbol 
ID3102120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1417183 
End bp1418604 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content62% 
IMG OID637170508 
Producthypothetical protein 
Protein accessionYP_113792 
Protein GI53804574 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.643696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCGCCG CAACCCTTCC CGCGGCCGGC GAACCCCTCA ACCTCGAGCT GCCGGACATG 
GGCGACTCCA CCGGCACTCT TTTCACGCCC CAGCAGGAAA AAGCGCTGGG CGAGGCCTTT
TACCGGAATC TGCACCTCCA GGTCCAGATC AACGAAGATC CGGAAGTCAC CGACTACATT
CAAGCGCTGG GGCGGAAGCT GGTGGAAAAC AGCGACACGC CAGCCCAGCC TTTTTACTTC
TTCGTCGTCA ATCAGCCGGT CATCAACGCG TTCGCCGGTC CCGGCGGCTA CATCGGCGTC
AACTCGGGGT TGATCCTCAT CACCGAAAGC GAAAGCGAGC TGGCCTCGGT GCTCGGACAC
GAAATCGCAC ACATCACCCA GCGCCATCTC TACGAGGCAT TCCAGGCCGC CGGCCGGCTA
TCGCTGCCGA CCGCGGCGGC CATGCTGGCC GGCGTGCTGC TGGGCGCAGG CACTGGCTCC
AGTCAGTTGG GCCAGGCTGC AGTCATCGCC GCCACGGCAG CCAGCCAACA GATGCAGATC
AATTTTACCC GGGACAACGA GGCGGAAGCC GATCGGGTGG GCATGAAAAT CCTCTCTGGC
TCGAACTTCG ATCCCCGTGC GATGCCCACC TTCTTCGAAC GAATGCAGCA ATCCACCCGC
TTCTCCACCG GCCGCAGCAC GCCGGAATTT CTCCTGACCC ACCCGGTCAC CGTGTCGCGT
ATCGCCGACA CCCGCGGGCG GGCCGAACAA TATCCCTACA AGCAATATCC CGACTCGTTC
ACCTACCAGA TCATCCGGGC CAAGCTGCAC GTTCAGACGA CCCACAATCC TCAGGAAAGC
GTCGATTATT TCACCGCCAT TTCGGAGGTG GGCACCCGTC AGCAGCAAGA CGTGGCCCAT
TACGGACTGG CCCTTGCCCT GGTCGCCCAG GGCAAGATTG GTCAAGGCAG ACCCATGCTG
GAGGAACTCA TCCGCCGCTA TCCCGAGCAG TCGCACTTCT TCAATGCCCT CGCTGACGCG
GAACGCGAAG CCAAGACCTA CCCCGCCGCC TTCGCTATCT ACGAGGAAGC CTTGAAGCGC
TTTCCCGGCA ACCGCGCGCT CACTTTGAAC TATGCCCAGA CCCTGGTCCG CGCCGGCAAA
CCCCTGGAGG CGCGCAAGCG GCTGCAGGAC TACCTGCTCC ATTTTCCCGC TACGCCGGAG
GTATATGAAC TGCTGGCGCA AGCCCACTCC CAGCTCGGCA ACGAAGCGGA ATCCCACCGA
TACCTGGCCG AAGCCTATTA CGCCGACGGT CAGACCCGCA ACGCCATCCT GCACCTCAAG
CTGGCACAGA AAGCACCAGG CCGCGATTTC CAGACCGACG CGGCGATCGA GGAGCGACTG
AAGGAACTAA TGGAAGAGCA GAGGGAGGAA AGGGAAAAAT GA
 
Protein sequence
MIAATLPAAG EPLNLELPDM GDSTGTLFTP QQEKALGEAF YRNLHLQVQI NEDPEVTDYI 
QALGRKLVEN SDTPAQPFYF FVVNQPVINA FAGPGGYIGV NSGLILITES ESELASVLGH
EIAHITQRHL YEAFQAAGRL SLPTAAAMLA GVLLGAGTGS SQLGQAAVIA ATAASQQMQI
NFTRDNEAEA DRVGMKILSG SNFDPRAMPT FFERMQQSTR FSTGRSTPEF LLTHPVTVSR
IADTRGRAEQ YPYKQYPDSF TYQIIRAKLH VQTTHNPQES VDYFTAISEV GTRQQQDVAH
YGLALALVAQ GKIGQGRPML EELIRRYPEQ SHFFNALADA EREAKTYPAA FAIYEEALKR
FPGNRALTLN YAQTLVRAGK PLEARKRLQD YLLHFPATPE VYELLAQAHS QLGNEAESHR
YLAEAYYADG QTRNAILHLK LAQKAPGRDF QTDAAIEERL KELMEEQREE REK