Gene MCA1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1003 
Symbol 
ID3103760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1049465 
End bp1051129 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID637170189 
Producthypothetical protein 
Protein accessionYP_113480 
Protein GI53804877 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGGCT TCCCACGGGG CCGGTTGGTT GCACTCGTGG CGGCATTTCT GATCAGCGCC 
GATGCCGCGG GCACCACACC GGATGCGGCG GCGCACATCG ATGGACTGCT CGCTTCGGGT
GTCCACCCCC GCTTGCGCTG GGAGCGTTTC ACCGATTTCC AGGAGCCCTT GCGTGCCCTG
TATCTTGCCC AGGGTTCACG GCCTTTGTGG CTGGACGGGG GCAGGCCGGT GAAGCAGGCT
TCCGCCGCTC TCGAGTGTCT GCGCATGGCC GACGACCAGG GCTTGAACAG CAGCGACTAT
GACGCCGATC TCCTGGGCGG CTGGATCGAG AAGCTCAACG ACGACAGGGC GGCGAGTGCG
GAAGAGGTGG CGCAGTTTGA GGTGGCGATG AGTCTCGCTC TGATGCGCTA CGGCTCGAAT
CTCGCCCGCG GCCGGGTCCA CCCTCGCGCC GCCGGTTTTG CTCTGGACGT GGCGTCGAAG
CGGCTCGATC TACCGGCGCT GGTGCAGCAC CTCGCCCGCG ACCCCTGGCC CTGCGAGGCC
ATCGCCGGGC TGGAGTCGAA GCTGCCACTG TACCGGAACC TCAAGGCGGC ACTGCCGCGT
TACCGGGGAT TGGCCGAGAA CAACGATGTC TCGGCGCTTG CCCTCCCGCC CAAGCTCAGC
CCGGGCGACC GCCACGGGGA GGTTCCCGCC TTGCGCAAGC GTCTGGCCGC TTTGGGTTTC
CTGTGGCAAG AGTCGTCCTC CAAAGAACCG GAGGTCTATG CCGGCGATCT GGTCGAGGCG
GTCGCGCGGT TCCAGGAGCG CCATGGTCTG GCACCGGACG GGGTGATCGG CAAGGGCACG
CTGGCGGCGC TCAACGTGCC GCCTGCCGCG CGTCTCAGGC AGATCCGGCT GGGGCTGGAG
CGGCTGCGCT GGCTGCCGGA GCGGTTCGAA GGCCCCTTCA TCCTGGTGAA CATCCCCTCC
TTCCGTTTGT ACGGCTACGG CCAGGACCCC GAGCGGCCGG AGGTGTCGAT GAACGTGGTC
GTGGGCCGGT CGTCGGGGGG ACACAACACG CCGGTGTTCC ATTCCGACAT GACCTACGTG
GTGTTCCGCC CCTATTGGAA CCTGCCGCGC GCCATCACGG TCAAGGAGAT GCTGCCCGGC
ATTCTGCGCG ACCCCGGCTA TCTGGCCCGC CACAACCTGG AGCTGGTGCC CAGCTTCGGC
AACGGCTCCC AGGTCTACGA GCCCAGCCTG GAAAGCCTGG AGATGCTGTC GGCCGGCTCG
CTCAAGCTGC GCCAGCGGCC GGGGCCGAAG AACGCGCTGG GGCTGGTCAA GTTCGCTTTT
CCCAACAACG ACAACATCTA CCTGCACGGC ACGCCCAGCG TGAACCTGTT CCAGCGGGCG
CGGCGGGATT TCAGCCACGG CTGCATCCGC GTCCAGGATC CCGTGGGCCT GGCGGAATTC
GTCCTGAAAC GCGAGGGCGA GACCTGGACT CAAGAGCGGA TCGAGGAGGC GATGAACGGC
GCCCAGTCGC GCACGGTCAC GCTGAAGCAG CCGCTGCCGG TCTACATCTA CTACTCGACC
GTGCTGGCCG AGCCGGACGG TACCGTGCGG TTCTTCGAGG ACATCTACGG ACTCGACCGG
GTACTGGAGC AGTTGCTGGA GAAGGGCTTC CCGTATCCCT CCTGA
 
Protein sequence
MGGFPRGRLV ALVAAFLISA DAAGTTPDAA AHIDGLLASG VHPRLRWERF TDFQEPLRAL 
YLAQGSRPLW LDGGRPVKQA SAALECLRMA DDQGLNSSDY DADLLGGWIE KLNDDRAASA
EEVAQFEVAM SLALMRYGSN LARGRVHPRA AGFALDVASK RLDLPALVQH LARDPWPCEA
IAGLESKLPL YRNLKAALPR YRGLAENNDV SALALPPKLS PGDRHGEVPA LRKRLAALGF
LWQESSSKEP EVYAGDLVEA VARFQERHGL APDGVIGKGT LAALNVPPAA RLRQIRLGLE
RLRWLPERFE GPFILVNIPS FRLYGYGQDP ERPEVSMNVV VGRSSGGHNT PVFHSDMTYV
VFRPYWNLPR AITVKEMLPG ILRDPGYLAR HNLELVPSFG NGSQVYEPSL ESLEMLSAGS
LKLRQRPGPK NALGLVKFAF PNNDNIYLHG TPSVNLFQRA RRDFSHGCIR VQDPVGLAEF
VLKREGETWT QERIEEAMNG AQSRTVTLKQ PLPVYIYYST VLAEPDGTVR FFEDIYGLDR
VLEQLLEKGF PYPS