Gene Mchl_5071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5071 
Symbol 
ID7118945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5428098 
End bp5429495 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content68% 
IMG OID643527765 
Producttriple helix repeat-containing collagen 
Protein accessionYP_002423764 
Protein GI218532948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0278582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.141411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC CGATCAATTT TAATAATACG GTCTATGCTC CTGGATTCCT TGGTACCGAT 
GACGGCGGGG CATCTGGTAA TTTCCAAGTC GATGCTGCCT CCACCTACTC TGTCAGCGTC
ACGGGGACGG TCAACGGACC GGGTAGCACG GTCAGCTTGA CTTACGGGGC CGACGCGCCA
GCCGCATTTG CCGGAAAATC GGTGCAGCTC ACTGCAACCC AATTCGACAA TTCGAATATG
ATATTGTTTA CCAGCAATGC CATTCCTCCC GGAGAGACGG ACCAGGGGAA TTATCGATAC
ATTCTGTCGA ATACCCAGTT GTACGGATCA AATCCTCCGG CCGGGACAAC TCGGACGCGC
TTTACAGCGG ACACCAATAA TGCCCTCGGT GATTACGCAG TAGCGGCGAC GGGCGGCACG
GGAGCAACGG GCGGCACCGG AGCCACGGGC GGCACGGGAG CAACGGGCGG CACCGGAGCC
ACGGGCGGCA CGGGAGCAAC GGGCGGCACC GGAGCCACGG GCGGCACGGG AGCAACGGGC
GGCACCGGAG CCACGGGCGG CACCGGAGCA ACGGGCGGCA CCGGAGCCAC GGGCGGCACC
GGAGCCACGG GCGGCACGGG AGCAACGGGC GGCACGGGAG CAACGGGCGG CACGGGAGCA
ACGGGCGGCA CGGGAGCAAC GGGCGGCACG GGAGCCACCG GCGGAACCGG AGCGACCGGT
GGCACAGGAG CCACGGGCAG CACCGGGGCC ACCGGCGGTA CCGGTCCCGT CATCTGCTTC
ACGCCCGGCA CCCGCATCGC GACGCCGGAT GGCGAGCGCG CGATCGAGCA CCTGCAGCCC
GGCGATGTCG TGAGCCTCGC CGACGGCGCC GTCGCCACCG TACGCTGGAT CGGCCGTCGC
TTCCTCGATC TGCGGACGCA TCCGCAGCCC ACCACCGCTC ACCCCGTGCG GATCGCCGCC
GGCGCCTTCG GTCAGAGCCT GCCAGTGCGG GACCTCATCG TCTCGCCCGG CCACGGCCTC
TACTGCGACG GCGTTCTCAT CCCCGCGATC TGCCTCGTCA ACGACCGCAC GATCACACGG
GTTGAGGTCA CGTCGGTCGA ATACCTGCAC GTCGAGTTGG AGCGGCATGC ACTCCTACTG
GCCGAGGGGC TGCCGACGGA AAGCTATCTC GACGTGGACA ACCGCGGCTT CTTCGAGAAC
GGCGGAGCGC CGCTGATCCT GCACCCGACC TTCGCGGCGA TGGCGCATGA GGGGGGCTGT
GCGCCCTACG TGATTGCCGG GGCCAAGCTG CGAACGGTGC GGGCGCAACT GGAGCGTCAG
GCCGACATCT GGGAGGCGCA GCGGCAGCCG GGTACCGGCT GGCGGGCACG TCTCGGCCTC
AGCCGCCGCA CCGCGTGA
 
Protein sequence
MALPINFNNT VYAPGFLGTD DGGASGNFQV DAASTYSVSV TGTVNGPGST VSLTYGADAP 
AAFAGKSVQL TATQFDNSNM ILFTSNAIPP GETDQGNYRY ILSNTQLYGS NPPAGTTRTR
FTADTNNALG DYAVAATGGT GATGGTGATG GTGATGGTGA TGGTGATGGT GATGGTGATG
GTGATGGTGA TGGTGATGGT GATGGTGATG GTGATGGTGA TGGTGATGGT GATGGTGATG
GTGATGSTGA TGGTGPVICF TPGTRIATPD GERAIEHLQP GDVVSLADGA VATVRWIGRR
FLDLRTHPQP TTAHPVRIAA GAFGQSLPVR DLIVSPGHGL YCDGVLIPAI CLVNDRTITR
VEVTSVEYLH VELERHALLL AEGLPTESYL DVDNRGFFEN GGAPLILHPT FAAMAHEGGC
APYVIAGAKL RTVRAQLERQ ADIWEAQRQP GTGWRARLGL SRRTA