Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_4162 |
Symbol | |
ID | 7301886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 4218922 |
End bp | 4220244 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643601816 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002499343 |
Protein GI | 220924041 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTTTTT CCGACAGTCC TTGGGGAACG GGCCTTGGCC TGTTCGCCGT CGTGTTCCTG GTCTTCGCGA ACGGGTTCTT CGTCGCGGCC GAGTTCGCCC TCGTGGCGGT GCGGCGCAGC CGGGTGCAGG AACTGGTGGC GGAGAAGCGA GCCAACGCCG CCGCTCTCCA GCGGGCGACC GACCATCTCG ACGCGCATCT CGCGGCGACG CAGCTCGGCA TCACCATCTC GTCGCTCGCC CTCGGCTGGG TCGGCGAGCC GGCTCTGGCC CACCTGATCG AGCCGCTGCT CGCATGGCTG CCGCCGCCCC TGGGAGCGGC GAGCGCCCAC GCGATCTCGG TCGTCGTGGC GTTCGTGGTG ATCACGGCCC TGCACATCGT GCTCGGCGAA CTCGCGCCCA AGAGCCTCGC GCTCCAGCGC AGCGAGCGCA CCGCCCTGGC CGTGGTCCGG CCCCTCCGGC TGTTCCTGCT CCTGTTCCGT CCGGCGATCG CCTTCCTCAA CGGGCTCGGG AACGGCGTCC TGCGCCTGTT CGGTCTCCAG CCCGGCTCGG GCGAGGATTC GCTGCATTCC CCGGCCGAGC TCACCCTCCT CGTCGCGGCG AGCCAGGAGG CCGGCCTGAT CCAGGAGGCG CAGCAGGAGG CGGTCGCGCG CATCTTCGGC ATCGGCGAGC GGCGCATCCG GGACATCATG ACCCCGCGCC ACGAGGTCGA CTGGGTCGAC ATCGAGGAGC CTCGGGAGGC GATCCTCGAA ACCGTGCGGG CCTGCCGTCA CGAGGCGCTG GTGGCGAGTC GGGGCGAGAT CCACGAGATC GTCGGCGTCC TGCGCAAGCA GGACATCCTC AACCAGATCC TCGACGGGAC CTCCGTCGAC ATCGCGCCCC TGATCCGCGA GCCGATCGTG GTGCATGAGG GGATGCCGAT CCTGCGCGTG CTCGAGACCT TCAAGGCCAA GCCGGTACGC ATGGCGATCG TCGTGGACGA ATACGGCAAC CTCGAAGGCA TCGTCACCCA GACCGACCTT CTGGAGGCGA TCGCAGGCGA CATTCCGGAC GCAGAGGACG AGGAGCCGAT GGTGGTGGAG CGGCAGGACG GCTCGCTCCT GATCGACGGC ATGATGCCGG CGGTCGAAGC CTTCGAGCGC CTGGGCTTCA GCAATCCGCC GGACACGGAC GATTATTCCA CCCTCGCCGG ATACGTGATC TCCGAACTCG GCCGCATCCC GTCGGCGGGA GACGCCTTCG AGCGGCAGGG CTGGCGCTTC GAGGTCATCG ACATGGACGG TCGCCGCGTC GACAAGATCC TGGCCGAGCG CGCACCCGCC TGA
|
Protein sequence | MIFSDSPWGT GLGLFAVVFL VFANGFFVAA EFALVAVRRS RVQELVAEKR ANAAALQRAT DHLDAHLAAT QLGITISSLA LGWVGEPALA HLIEPLLAWL PPPLGAASAH AISVVVAFVV ITALHIVLGE LAPKSLALQR SERTALAVVR PLRLFLLLFR PAIAFLNGLG NGVLRLFGLQ PGSGEDSLHS PAELTLLVAA SQEAGLIQEA QQEAVARIFG IGERRIRDIM TPRHEVDWVD IEEPREAILE TVRACRHEAL VASRGEIHEI VGVLRKQDIL NQILDGTSVD IAPLIREPIV VHEGMPILRV LETFKAKPVR MAIVVDEYGN LEGIVTQTDL LEAIAGDIPD AEDEEPMVVE RQDGSLLIDG MMPAVEAFER LGFSNPPDTD DYSTLAGYVI SELGRIPSAG DAFERQGWRF EVIDMDGRRV DKILAERAPA
|
| |