Gene M446_4370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4370 
Symbol 
ID6133073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4816158 
End bp4817192 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content78% 
IMG OID641644509 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001771147 
Protein GI170742492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.324657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTCC GCCGTGCCGT CCTGGTCCTG TCCACGCTCG TCCTGATCGG ATCGGGGCCG 
GCCTTCGCCC AGGGCGAGGC CGCCCCTGCC CCCGCCGCGC CGAAACCCGA GGCGCCGCGG
GGAGCGGCCC GAAAGCCCGC CTCCTCGGCG ATCCAGATCT GGGACGCGCG GATCGAGGGC
GGCGACCTGC GCATCTCCGG CAATGTCGGC AAGGCGGGCG TGACCGTCTC GCTCGACGAC
GAGGTCGCGG TCCAGAGCGA CCGGCGCGGC CGCTTCGCGA TCAAGGTCCC GTACGTTCCG
CAGACCTGCG TGGCGACCCT GACGGCGGGC GAGGAGTCGC GCGAGGTCGC GGTGGCGAAT
TGCGCCCCGC AGGGCCAGCC CGGCCCCGCC GGCCAGCCCG GGCCGACCGG CCCGCAGGGC
GTGGCCGGCC TGCCCGGCCC GAAGGGCGAC CCAGGCCCGC AGGGACCGGC GGGTCCCAAG
GGGGAGCCCG GGCCCAAGGG GGAGCCCGGG CCCAAGGGGG AGCCCGGGCC CAAGGGGGAG
CCCGGGCCCA AGGGTGAGCC CGGGCCCAAG GGTGAGCCCG GGCCCAAGGG TGAGCCCGGA
CCCAAGGGGG AGCCGGGCCC GCGCGGAGAG GCCGGACCTC AGGGCGCGCT GGGGCCCAAG
GGCGAAGCTG GATCAAGGGG CGAACCCGGA CCAAGGGGCG AACCCGGCCC GAAGGGAGAG
GCGGGGCTGG CTGGCGCGCC CGGCCCGAAG GGCGAGGCCG GTCCGCGCGG ACCGCAGGGC
GAGCGCGGAC CCCCGGGCGC GCCCGGCGCG GCGGCCCCCG TCGCCGCGGC GACGGCGCTG
CCGATGCGGG TCCTGCGCAG CGAGACCTGC GCCACCGGCT CCTGCGAACT CGCCTGCGAG
GGCGGCGAGA CGCTGCTCTC GGCCTATTGC GTGCGGGCCG GCGCGCCGAC CTTCACGCGG
CGGGAGGGCG GGCAGGCCGC GGCCTTCTGC CCGTCCGAGA GCGCCGGCAT CGTCGCGGTC
TGCGCGAAGC TCTGA
 
Protein sequence
MRLRRAVLVL STLVLIGSGP AFAQGEAAPA PAAPKPEAPR GAARKPASSA IQIWDARIEG 
GDLRISGNVG KAGVTVSLDD EVAVQSDRRG RFAIKVPYVP QTCVATLTAG EESREVAVAN
CAPQGQPGPA GQPGPTGPQG VAGLPGPKGD PGPQGPAGPK GEPGPKGEPG PKGEPGPKGE
PGPKGEPGPK GEPGPKGEPG PKGEPGPRGE AGPQGALGPK GEAGSRGEPG PRGEPGPKGE
AGLAGAPGPK GEAGPRGPQG ERGPPGAPGA AAPVAAATAL PMRVLRSETC ATGSCELACE
GGETLLSAYC VRAGAPTFTR REGGQAAAFC PSESAGIVAV CAKL