Gene M446_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1004 
Symbol 
ID6131317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1122529 
End bp1124406 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID641641297 
ProductTPR repeat-containing protein 
Protein accessionYP_001767970 
Protein GI170739315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG TCGTCCAATG GTCCTGGCGG CCCTTCGGCC GCGCTCTCAT CGCGACCACA 
GCGGCCAGCG TATTGTCAGT GGGAAGCGGA GCCATCGCTT GCCCGCTTGG AAGTCACGAC
AGCCCGCGCG TGCGGCTTCT CCACACTCTG CTCCAGCCGA GGCCATCGGG CGCCACAGCC
GAAATTGTGA TCGCGCCGGC CCAGAACCAC GCGCATCCCC GCCCGCCCGC CCGCAACGAA
CTCGGGACAT ATAGCCACGA CCTCCCGAAA GGGGCGGCCG CGCCGACCTC GTCAGAGCCG
CCGCCGCTCT ATGACAACCT TGGTCGGCTC ACTTGGCCGG AGGCCCGCCC TGCACACGCC
GAGGCCGCCG CCTATTTCGA TCAGGCCTAT CGGCTTGCCT GGGCATTCAA TCACGCCGAG
GCTGCCCGGG CGTTCCGGGC GGCGCAAGTG CTCGATCCGA GCTGCGCCAT GTGCTTCTGG
GGCGAGGCCT GGGTGCTCGG CCCGCACATC AACTTCCCGA TCGAGGCCGA CGCGAATGCG
CGAGCACTGG TTGCCCTCGA TGAAGCCAAG CGCTTGGCCC CGTCCTCGGG ACCGGTTGGC
GCGGCGCTCA TCACCGCGCT TGCGAAGCGC TACTCACCCG ATGACAATGT GGATCGCAGG
TCACTCGACC ACGCCTATGC CGACGAGATG AAGGGCGTGC AGGCCCGGTT TCCAGAGAGC
CCGGAGGTTG CGCTGCTCAC GGCAGACGCC CTGATGAACC TGAGCCCGTG GGATTACTGG
ACGGACAACG GCCGGACCCC CAAAGGCGAA GCAAGGCGGA TGATCGAGCT GATCGAAGGC
GTGCTTGGTG AGAGCCAGGT AGGGGCTCTC GTTCCAGCAC CCGATCACCC TGGGGCCATT
CACCTCTACA TCCACGCGGT AGAGGCTTCG GACCGACCCG AACGAGCCGT GCCACATGCC
GAGCGGCTAG CCGACCTGAT GCCGGGCGCC GGACACATCG TGCATATGCC GAGCCACATC
TGGTATCGCG TCGGACGCTG GCGTGAGAGC CTCGACGCGA ACCTGCAGGC CGCCGCCGTC
GACGAGGCGC TAATCCGGCG AGGCGGCGCG AGCCTCCTCT ATTCGGAGGC CTACTACGCC
CACAACGTCC ACTTCCTCCT CGCGTCGGCC ACAATGGGTG GGGATGGGCA GACCGCGCTC
GCCGCGGCCG AGAAGCTCGC CGGAATGGTC TCAGATCGAG CTAAGCGTGA AGTGCCCTGG
TCGCAGCCGA TCGCTGCTGC GCCCTACAGC GCTCATGCGC GGTTCTCGTC CCCAAGCACC
ATTTTGGCCT TGCCAGCCCC CGACGCGAAC TTCCCGCTCG TTCGCGCGAA TTGGCATTAC
GCCCGCGGCG TCGCCCTGGC GCGGCTCGGC CGGGGTGACC AAGCACGGTC GGAAGCCGCG
GAGATCCGAA AGCTGGCCCA GCGGCCGGAG ATCGCCGCCC TCGTGCCTGC CGGCGTTCCG
GCGCCGGACG TCCTCGCCAT CGCCGCGAAG CTGATAGAGG CCAGGGTGGC CCAGAACGCC
CGCGATCATG CCCGCTCGGC TGCCCTGTTC AGGGAGGCTG CGGCGATCCA GGAGTTGCTG
CCCTATATGG AGCCACCTTT TTGGTACTAC CCCGTTCACC AATCGCTTGG TGCCGCGCTT
TTGGCGCAAG GTCGGCTGGA CGAGGCAGAG GCTGCGTTTC GCACGGCGCT CCGGCATTCG
CCCAACAATG GTTGGGCGTC CGCAGGCCTG CTGAGGGTGG CCGAGGCACG GGGCGATAGA
GCCGCCGCGA GCGAGGCGGA ACGGCTGATC AAAAGCAACT GGTTCGGCGG CGATGTGCCG
GCGCTCGACC GGCTCTGA
 
Protein sequence
MIDVVQWSWR PFGRALIATT AASVLSVGSG AIACPLGSHD SPRVRLLHTL LQPRPSGATA 
EIVIAPAQNH AHPRPPARNE LGTYSHDLPK GAAAPTSSEP PPLYDNLGRL TWPEARPAHA
EAAAYFDQAY RLAWAFNHAE AARAFRAAQV LDPSCAMCFW GEAWVLGPHI NFPIEADANA
RALVALDEAK RLAPSSGPVG AALITALAKR YSPDDNVDRR SLDHAYADEM KGVQARFPES
PEVALLTADA LMNLSPWDYW TDNGRTPKGE ARRMIELIEG VLGESQVGAL VPAPDHPGAI
HLYIHAVEAS DRPERAVPHA ERLADLMPGA GHIVHMPSHI WYRVGRWRES LDANLQAAAV
DEALIRRGGA SLLYSEAYYA HNVHFLLASA TMGGDGQTAL AAAEKLAGMV SDRAKREVPW
SQPIAAAPYS AHARFSSPST ILALPAPDAN FPLVRANWHY ARGVALARLG RGDQARSEAA
EIRKLAQRPE IAALVPAGVP APDVLAIAAK LIEARVAQNA RDHARSAALF REAAAIQELL
PYMEPPFWYY PVHQSLGAAL LAQGRLDEAE AAFRTALRHS PNNGWASAGL LRVAEARGDR
AAASEAERLI KSNWFGGDVP ALDRL