Gene Mchl_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4040 
Symbol 
ID7118045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4253061 
End bp4254962 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content74% 
IMG OID643526759 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002422768 
Protein GI218531952 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.345499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCT TCGTGCCCCT GACCACGGTG TTTCCCGGCC TCGTCTGGCT GATGGCGGCG 
CTCGCCCTCG TCCAGGGCTT GCGCCGTGCC GCCCTCTGGC GCGTCGGCGC GGCGGCGCCC
GTGGCGTGGC TCGACGGGCT CGCCAAGCTG CCCCGGCGCT ACCTCGTCGA TGTCCACCAC
GTCGTGGCCC GCGACGCCTA CGCCTCGCGC ATGCACGCGG TCGTGGCCGG CGGCCTGATC
GCGGCCTCGA TCCTGACCGC GCTGGCGATC CTGCCGCCGC TGGCCGATTT CCGGCCCTAC
TGGTTCCTCG TGGCGCTCGC CTTCGGCGTG ACCGCGATCG GCTCGCTGCT CGTCGGCGCC
CGGCGCTACC CGGCCAAGCC GAAGCGCCTC TCTGCCGGCC GCTTCCAGAT CCTGCCGTTC
CTCCTCGTCG CCTACGCCGT CGGCGGCACG ATCACCGCCC TCATCCTCGC GCTCGGCGGC
GCCGGCGGCG TGTTCGGTTC CGTCGCGCTT GCGCTCGCGG CGGCCGGCGG GCTCGGCCTC
GCCTTCGAGG TGCGCCACGG CCCGATGCGC CACGCGGCGG CCGGCGCGCT CCACCTCGTC
GCCCATCCCC GGCCCGGCCG GTTCGAGGGC CGCCCCGACA CCGCGCTCCA GCCGCTCGAC
CTCGACGCGC CCCGCCTCGG CTCGGAGATG CCGGCCGACT TCACCTGGAA CCGGCTGCTG
TCCTACGACG CCTGCGTGAG CTGCGGGCGC TGTGAGACCG CCTGCCCGGC TTTCGCCGCC
GGCCAACCGC TAAATCCGAA GAAGCTGATA CAGGATCTCG TCGCCGGCCT CTCCCCCGCC
GAGCCCGCCT ATGCCGGCAA CCCCTATCCC GGCGGCCGGG CGGCGGAGGG CGCGCGCGGG
GCTTTGGCCC GGCTGGTCGG GCCGGATGCG CGCATCCATC CCGACACGCT GTGGTCCTGC
ACCACCTGCC GCGCTTGCGT CGAGGAATGC CCGATGATGA TCGAGCATGT CGATGCCGTG
GTCTCCCTGC GCCGTCACGA GACGCTGGAG CGCGGAGCGC TGCCCGAGAA GGCCGTCGTC
CCGGTGACGG AGCTGCGCCA GTCCGGCGAT CCCGGCGGGC GCCCGCTGGC CTCGCGCACC
GATTTCGCCG CCGGGCTCGA CCTGCCGCGG ATCGCGGACC GTGAGCCCGT CGATGTCCTG
CTCTGGCTCG GCGAGGGCGC CTACGATCTG CGCTACGGCC GCTCTTTGCG GGCGCTGATC
CGGCTCCTGC GCGAGGCCGA GGTCGACTTC GCGGTGCTGG GCGCGGAGGA GCGCGACACC
GGCGACCTCG CCCGGCGGCT CGGCGACGAG GCGACCTTCC AGGCGCTGGC GCGGGAGAAC
ATCGCGACGC TGGCGAAGTA CCGCTTCAAG CGGATCATCA CCGCCGACCC GCACGCGCTG
CATGCCCTGC GCAACGAGTA CCCGGCCTTC GGTGGCCACT ACACCGTGAC CCACCACACC
GCCCTCCTGC TGGAGCTGAT CCGGGCCGGC AAGCTCAATC CGGGCCGTCT GCCCGACCTC
TCGGTGACCT ATCACGACCC CTGCTACCTC GCCCGCTACA ACGGGGAGAC CGAGGCGCCC
CGCGCGGTGC TCGACGCGAT CGGCGTCACC CGCCGCGAGA TGACCCGGTC CGGGCGCCGG
GCGATGTGCT GCGGCGGCGG CGGCGGCGCG CCGGTGAGCG ACGTGCCGGG CGAGCGCCGC
ATCCCCGACA TCCGCATGGC TCAGGCCGCC GAGACCGGGG CCGGGATCGT CGCCGTCGCC
TGCCCCTCCT GCACGGCGAT GCTGGAGGGC GTGACCGACC GCAAGGCGGA GATCCGCGAC
GTGGCCGAAC TCCTGCTCCA GGCGGTGGAG GCGGGCCGAT GA
 
Protein sequence
MTSFVPLTTV FPGLVWLMAA LALVQGLRRA ALWRVGAAAP VAWLDGLAKL PRRYLVDVHH 
VVARDAYASR MHAVVAGGLI AASILTALAI LPPLADFRPY WFLVALAFGV TAIGSLLVGA
RRYPAKPKRL SAGRFQILPF LLVAYAVGGT ITALILALGG AGGVFGSVAL ALAAAGGLGL
AFEVRHGPMR HAAAGALHLV AHPRPGRFEG RPDTALQPLD LDAPRLGSEM PADFTWNRLL
SYDACVSCGR CETACPAFAA GQPLNPKKLI QDLVAGLSPA EPAYAGNPYP GGRAAEGARG
ALARLVGPDA RIHPDTLWSC TTCRACVEEC PMMIEHVDAV VSLRRHETLE RGALPEKAVV
PVTELRQSGD PGGRPLASRT DFAAGLDLPR IADREPVDVL LWLGEGAYDL RYGRSLRALI
RLLREAEVDF AVLGAEERDT GDLARRLGDE ATFQALAREN IATLAKYRFK RIITADPHAL
HALRNEYPAF GGHYTVTHHT ALLLELIRAG KLNPGRLPDL SVTYHDPCYL ARYNGETEAP
RAVLDAIGVT RREMTRSGRR AMCCGGGGGA PVSDVPGERR IPDIRMAQAA ETGAGIVAVA
CPSCTAMLEG VTDRKAEIRD VAELLLQAVE AGR