Gene Mchl_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2554 
Symbol 
ID7117301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2683977 
End bp2685155 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID643525302 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002421324 
Protein GI218530508 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.311361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.38185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGGC GGATTGTCGT TGTCGCGGGG CTGCTTTTTG CCCCACTCTC CGCCCTGGCC 
GTCGAGACCG CGCCGACCGA AGGCCAGGAC TACGCCGCCG CCAAAGGCTG GTACGAGAAG
GCCGCCGCCG CGGGTGACGC CACCGCCATG CACAAGCTCG GCCTTCTCTA CGAGGAGGGC
CAGGGCGTCG CGCAGGATTA TGCCGCCGCC CGAGGCTGGT ACGAGAAGGC CGCCGCCAAG
GGGTTGGCGG AGTCGATGTA CAATCTCGGC ATTCTCGACG AGTTCGGCCG GGGCGTGGCG
CAGGACTACC CGGCCGCCAA GGGCTGGTAC GACAAGGCGG CCGCCGCGGG TGATGCGGAC
GCCATGCAGA AGCTCGGCTA CTTCTACGAT GTCGGCCAGG GCGTGCCGCA GGACTATGCC
GCGGCCAAGG ACTGGTACGA GAAGGCGGCG GCCGGGGGCA GCGCCAGCGC CATGAACAAT
CTCGGCGTGC TGTACGAGAA CGGGCAGGGC GTGAAGCAGG ACTATGCCCG CGCCAAGACC
TGGTACGAGA AGGCCGCCGC CGCCGACACG GGCGACGCCA TGCGCAGCAT TGGGCGTCTG
TATCTCAATG GCCTGGGCGT GACGCAGGAT TACGCCGCGG CCAAGGGCTG GTTCGAGAAG
GCCGCGAGCG CGGGCAGCGC GGAGGCCATG AACGATCTCG GCCTCGTCTA CGAGGACGGG
CAGGGCGTTG CGAAAGACGA TGCCGCCGCC AAGGGCTGGT ACGAGAAGGC CGCCGAGGCG
GGCAACCCGT TCGCCATGAC CAATCTCGGC TCTCTGTACG AGAACGGACA GGGCGTGAAG
CAGGACTACG CCACGGCCAA GCTCTGGTAT GAAAAGGCCG CTGCCGCGGG CAATGCCCAG
TCCATGTACA ATCTCGGTGC CCTGTACGAG AACGGCCAGG GCGTGAAAAA GGACTACGGA
GCGGCCAAGC TCTGGTACGA GAAGGCGGCC GATGCCGGGA GTTCGGAGGG CATGTCCGCG
CTCGGCACCC TCTACGCCGA GGGGTGGGGT GTGGCGCGCG ACCGGAGCGC CGCCAAGCTC
TGGTATGAGA AGGCCGCCGC CCTCGGCGAC ACGGGGGCGA TGCAGAAGAT CGCCGCCCTG
TTCGAGAAGG GCACGGGCAA AGCGGGCGCC AAACGCTAG
 
Protein sequence
MLGRIVVVAG LLFAPLSALA VETAPTEGQD YAAAKGWYEK AAAAGDATAM HKLGLLYEEG 
QGVAQDYAAA RGWYEKAAAK GLAESMYNLG ILDEFGRGVA QDYPAAKGWY DKAAAAGDAD
AMQKLGYFYD VGQGVPQDYA AAKDWYEKAA AGGSASAMNN LGVLYENGQG VKQDYARAKT
WYEKAAAADT GDAMRSIGRL YLNGLGVTQD YAAAKGWFEK AASAGSAEAM NDLGLVYEDG
QGVAKDDAAA KGWYEKAAEA GNPFAMTNLG SLYENGQGVK QDYATAKLWY EKAAAAGNAQ
SMYNLGALYE NGQGVKKDYG AAKLWYEKAA DAGSSEGMSA LGTLYAEGWG VARDRSAAKL
WYEKAAALGD TGAMQKIAAL FEKGTGKAGA KR