Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6111 |
Symbol | |
ID | 6134951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 6720733 |
End bp | 6725409 |
Gene Length | 4677 bp |
Protein Length | 1558 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641646209 |
Product | YD repeat-/RHS repeat-containing protein |
Protein accession | YP_001772821 |
Protein GI | 170744166 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.25246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTAC TTGGAAGCGT GGGGACCGGG CGAAAGTGGA TTGTTCTTCT GGCGGCTTGG CTGATCGTCG GGATCGCATC GACGCGCGTC GTGGCCGCGA GGACCGAGGC TGAGGGGGCC GATGCATTCC GCCTTGCAAT CGTCCAGCTG GCCGAGCCGC TGGTCCGGAC GCGACCGACG ACCGCCGCCG AGGACCAGGA CCTCGCGGCC GCGCTGGCAC GCTTTCGCGC GCGGGCCCAG GTGGACGACC TCGCGGCCCT CGAAGCCTTC CTGGATGCGC ACCCGGATAG CGGCTGGGCG CCCGCGCTCC ATCTCAATGT CGGCCTGACC TATCGCCATT ACGGCTACGT GACGCGGGCC GGGACGGCTT GGCGCGCGGC CTGGCGGCTC GGACGCGCGG CGCAGGATCC GGAGGCGCGG GCGCTCGTGG ACCGCGCCGT CGGGGAGCTG GCGCTCCTCC TCGCCTCCCT CGGGGACAGC GACGCCCTGG CGACCCTGTT CGCGGAGATC GGGGCGAGGC CCGTCACCGG CCCGGCGACG GAGAAAATCC AGGTCGCCCG CGAGACCCTG GATCTCGTCA CCAAGGATCC GCGCCACCTG TTCAATTGCG GGCCCCTCGC CCTGCGAAGC CTCCTCGTCG CGCGTGGGGG CAGCCCGGCG GAGGCGGAGG CGCTGCGCTG GCTCCGGGTC GGCCCGAACG GCACGAGTCT GGCGGAGGTC TCGGCGCTGG CGAGCAAGGC GGGGGCCCCG CATCGGCTCG TGCGGCGCGA GCCGGGCCAG CCGGTCCCGG TGCCCTCCGT GATGCACTTC AGGGCCGGGC ACTTCGCGGC CATCGTGGCG GCGGAGAACG GCCGCTTCCA CCTGCACGAT CCCGTCCTGG GCGGGCAGGA GCTGTGGCTC ACGACCGGCG CCGTGGATGC CGAGGCCAGC GGGTACTTCC TGGTGCCCGA CGGCGCGGCG AAGACCGCCG GCCAGGATTG GAGGGAGGTC GCCCCCGCCG AAGCCGGTCA GGTCTGGGGC AAGGGACCGA CGAACGGCGT CGTCCCGGGC GATGCCGGGG ATCCCCTGGC CAGCGCGCCG AGCCCGAATT GCGGGATGTG CGGCACCAAC ATCAAGGAGG CCACCGTCGG CCTCACCCTG TCGGATATGC CGGTCGGCTA CGTCCCGGCC ATCGGACCGG AGGTGCGCAC CCGCATCAGC TACAACCAGC GGGAGGATAG CCAGCCTTCG GTGTTCGGCT TCTTCAACCT CGGTCCGAAA TGGACCATCA ACTGGTTGAG CTACGTCTCG GACGACCCGC AGAACCCGGG CGGGAACGTG TCCCGGGTGA TCGCGACGGG GGGCGCCTAC TTCTACACGG GGTATTCCGG AACGACCGGC GCGTTCGCGC CCCAGAGCGA CGACGGCTCG GTTCTCACCC GGGCGGCGGA ATCTCCGATC GCGTATCGCC GCAGGCTGCG CGACGGGACG ACAGAGATCT ACGCCGAATC GGACGGCAGC ACCGGCTTCC CGCGCCGCAT CTTCCTCAGC AAGGTGGTGG ACCCGCAGGG CAACGCGCTC AGCTTCACGT ACGATGCGCA GCTGCGCCTG ACGGCGCTGA CCGACGCGGT CGGGCGGCAG ACGCGTTTCG GCTACGGGCA GCCGGGCCGG CCGCTGCTGC TCACCCAGAT CACCGACCCG TTCGGCCGCA GCGCCTCCCT GTCCTACGAC GGGCTCGGAC GGCTGAGCGC CATCACGGAC GTGATCGGGC TGACCTCGCG CTTCACCTAC GACGCCAACG GCCTCGTGAC CGCGCTGACG ACGCCCTACG GCACGACCCG CTTCGCCTTC ACGGCCCCCG GCACCAGCGC GCCCCCGCGC TTCGTGGACG TGACCGATCC CCTCGGCTTC CACGAACGCG AGGAGTGGCT CGAACCCGCC CCGATCCCCG ACAGCGAGCC GGCCGCGACC GTGCCGCAGG GCATGCCGGT GGCGCCGGCC AACCAGTACC TCACCTACCG GAACAGCTTC CACTGGGACA AGGACGCCTA CGAGCAGGCC GGATGCACGC CCGCGGGTGG GTGCGACTAC GCGAAGGCCC GCCTCCGTCA CTTCGTGCAT GCCGGCAACG GCATCAAGGG CGTGGCCATC GAGAGCGAGA AGCAGCCGCT CGAGAACCGG GTCTGGTACA ATTACCCGGG ACAGACGGAC GCGATTTCCA GTGGTGCCTT CAACCGCCCG AGCGCGGTGG CGCGCGTGCT CGACGACGGC ACGACGCAGC TGACGCGCTT CGATTACGAC ACGACGGGCT ACAACCTGTC CCGCATCGTC GATCCGCTCG GCCGCACGAC GCACCTCACC TACGCCCCGA ACGGAATCGA CCTCCTGGCG GTCACCCAGG TCTCGGGGCC GGGGAGCTTC GCGCCGCTCG CGCAATTCAC CTATGACGGA CAGCATCGTC CGATCGTCCA CACCGACGCG GCCGGCGCGA CCTCGACCTT CGCGTACAAT GCGGCCGGGC AGCTGACCGC GGCGACGAAC GCCCTGAACG AGACCACCCG CTTCACCTAC GATCCGGCGG GCAACCTGCT CAGCGTCGTC AACGCCACCG GGGCGAGTGC CGCGACCTTC ACGTACGACG CGGCCGACCG GGTCCGGACC TACACCGATT CCGAGGGCTG GACCGTCACC TACGCGTACG ACGCGGCCGA CCGGATCACC GCGATCACCT ATCCGGACGG GACCTCCCAG ACCTACAGCT ACGACCGGCT CGACCTCGTT GGATACCGGG ACCGGCAGTT CCGGACCTGG ACCTACGGCT ACGATGCCAA CCGGCGGCTG ACCAGCCTGG TCGATCCGGG CGGCAGCCGG ATGCTGTTCG GCTATACCGG CCAGGGCCGG CTGGCCAGCC TGACCGACGC CAAGGGCGCC GTCACCCGCT GGAGCTACGA CGTCCAGGGG CGCCCGATCG CCAAGCGCTA CGCGGATGGC AGCACGGTCA CCTCCGCCTA CGAGACCGCG ACGAGCCGCC TCAAATCGGT CACCGACGCG CTCTCCCAGG TGAAGCAGTA CAGCTACGCG CGGGACGACC GGCTGTCGGG GATCAGCTAC CCCAACGCCC TGAACGCCAC GCCGGCCGTG ACCTTCGCGG ACGATCCCTT CTTCCCCCGC ACGCTGTCGA TGAGCGACGG CACCGGCACG ACCCGCTACA GCTACGCGCC GAGCTTCGCG CCGGGAGCGC AGCAACTCGC CCGGGAATGC CTCGTCCCAG CCGGCGCCGG CGACTGCGCC TGGAGCATCG CCTACACCTA CGACGCGCTC GGCCGGATGA TCGCGCGCAG CGTGTCGGGC TCCGGGCCGG AGACCGTCGC CTACGACGCG CTCGGCCGCA TCACCCGCCA CGGCAGCGAC CTCGGCTCCT TCGCCCTGTC CTATCTGGGC CAGACCGAGC AGCCCACCCT GCGCCAGCTC CTGCCGGCGG GCGCGAGCCT CGCCACCACC TGGAGCTACC TGCCCAATGC CGGCGACCGG CGCCTCGCCG GGATCGCCCA TACGGGCCTC GCCCCCGGCC AGTTCCTCAC GCTCGCATTC ACCTCCGCGC CCGAGACCCT GATCACCGGG GTCACCGAGA CGAGCGACGT CCCCTCCCCG GTCCCGCCGG CGGGCAGTCA GACGGCACAG TACAACGTCC TCAACCAGCT CACCGGCCTC TCGGGGCAGG CCCTGACCTA CGACGCCAAC GGCAACCTCG TCTCGGACGG GCCACGCGTC TTCGCCTGGG ACGCCGAGGA CCGGCTGGTG CGCATCACCT ATCCGGCCCA GCCCGGCAAG GTGAGCAGCT TCACGTATGA CGGGCTGGGT CGGCGGGTGA CGATCGGCAG CACGCCGGCG GGCGGCGCGG CGGCAAGCGT AACGGCGTAT GTCTGGTGCG GCGAGAGCCT GTGCCAGTCC CGGACCCAGG CGGGCGCGGT GCTGCGGTCG TACCTGGCGG AGGGCGAGGT GGCGGCGGGC GCGCCGGGGC AGCGCTTCTA TTACGGGATC GACCAGATCG GCTCGGTGCG GCGGGCCTTC GCGAGTGCGG GCGGCGCCCC GGCCTTCGCC TACGATCCTT ACGGGAACGC GCTTCAGGGC GGCGCGCCGG TCACGGATTT CGGGTTCGCC GGCATGTTCT ATCATGCCGA GAGTGGTCTA TATCTGACAC ACTTCCGAGC TTACGATCCA GCTATCGGCC GGTGGATCTC GAGAGACCCT ATCGGAGAGC TTGCCGATGT TACAAGCCAT CCCCAATCTG ATGACGCATT TTTCAACTTC AGGCCTTTCG ATGGCCAGTC CTTATCAACG CTTCGGCTTA GTGCGGGCGC CAAAAGACCT CACTGGCCGG CACTCTCAAC CTTGGAAGCG AACCGAGGGA GGCTCCTCGC TCCAAGTGCG CTCTCGCTTT ACGACGGACT GAGCCTATAC AGTTACGTTG CGCAAAATCC GCTGGCGCGC ACGGACCCGC AGGGCCTCTC CGGTTTTCGG TGTGATGGAT TTTCCGCAGG ATGCCAGAGT GGCGGCACAT ACGGTACAAC CGCCATGTAC TGCGTCAGGG GGAGAAAGCT CTGCTTCGAC TGTGCAGTCA AATATCTTGG GTTGGATGGA GAGCCAAATC ATGTGAAATT GAAAGCTCTT GAGCGGTATC TAATTGAGGA AGATTAG
|
Protein sequence | MSVLGSVGTG RKWIVLLAAW LIVGIASTRV VAARTEAEGA DAFRLAIVQL AEPLVRTRPT TAAEDQDLAA ALARFRARAQ VDDLAALEAF LDAHPDSGWA PALHLNVGLT YRHYGYVTRA GTAWRAAWRL GRAAQDPEAR ALVDRAVGEL ALLLASLGDS DALATLFAEI GARPVTGPAT EKIQVARETL DLVTKDPRHL FNCGPLALRS LLVARGGSPA EAEALRWLRV GPNGTSLAEV SALASKAGAP HRLVRREPGQ PVPVPSVMHF RAGHFAAIVA AENGRFHLHD PVLGGQELWL TTGAVDAEAS GYFLVPDGAA KTAGQDWREV APAEAGQVWG KGPTNGVVPG DAGDPLASAP SPNCGMCGTN IKEATVGLTL SDMPVGYVPA IGPEVRTRIS YNQREDSQPS VFGFFNLGPK WTINWLSYVS DDPQNPGGNV SRVIATGGAY FYTGYSGTTG AFAPQSDDGS VLTRAAESPI AYRRRLRDGT TEIYAESDGS TGFPRRIFLS KVVDPQGNAL SFTYDAQLRL TALTDAVGRQ TRFGYGQPGR PLLLTQITDP FGRSASLSYD GLGRLSAITD VIGLTSRFTY DANGLVTALT TPYGTTRFAF TAPGTSAPPR FVDVTDPLGF HEREEWLEPA PIPDSEPAAT VPQGMPVAPA NQYLTYRNSF HWDKDAYEQA GCTPAGGCDY AKARLRHFVH AGNGIKGVAI ESEKQPLENR VWYNYPGQTD AISSGAFNRP SAVARVLDDG TTQLTRFDYD TTGYNLSRIV DPLGRTTHLT YAPNGIDLLA VTQVSGPGSF APLAQFTYDG QHRPIVHTDA AGATSTFAYN AAGQLTAATN ALNETTRFTY DPAGNLLSVV NATGASAATF TYDAADRVRT YTDSEGWTVT YAYDAADRIT AITYPDGTSQ TYSYDRLDLV GYRDRQFRTW TYGYDANRRL TSLVDPGGSR MLFGYTGQGR LASLTDAKGA VTRWSYDVQG RPIAKRYADG STVTSAYETA TSRLKSVTDA LSQVKQYSYA RDDRLSGISY PNALNATPAV TFADDPFFPR TLSMSDGTGT TRYSYAPSFA PGAQQLAREC LVPAGAGDCA WSIAYTYDAL GRMIARSVSG SGPETVAYDA LGRITRHGSD LGSFALSYLG QTEQPTLRQL LPAGASLATT WSYLPNAGDR RLAGIAHTGL APGQFLTLAF TSAPETLITG VTETSDVPSP VPPAGSQTAQ YNVLNQLTGL SGQALTYDAN GNLVSDGPRV FAWDAEDRLV RITYPAQPGK VSSFTYDGLG RRVTIGSTPA GGAAASVTAY VWCGESLCQS RTQAGAVLRS YLAEGEVAAG APGQRFYYGI DQIGSVRRAF ASAGGAPAFA YDPYGNALQG GAPVTDFGFA GMFYHAESGL YLTHFRAYDP AIGRWISRDP IGELADVTSH PQSDDAFFNF RPFDGQSLST LRLSAGAKRP HWPALSTLEA NRGRLLAPSA LSLYDGLSLY SYVAQNPLAR TDPQGLSGFR CDGFSAGCQS GGTYGTTAMY CVRGRKLCFD CAVKYLGLDG EPNHVKLKAL ERYLIEED
|
| |