Gene M446_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5088 
Symbol 
ID6130034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5582571 
End bp5584463 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content69% 
IMG OID641645223 
Producthypothetical protein 
Protein accessionYP_001771848 
Protein GI170743193 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.937332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCA TCACCCTCTC GGCTGCGACG CGTCAGAACC TGCTGTCGCT GCAGGACACG 
GCGAGCCTGC TCGCCACCAC CCAGAACCGC CTCGCGACCG GCAAGAAGGT CAACACGGCC
CTCGACAACC CGACCAACTA CTTCACGGCC GACGGGCTGA CGGCGCGGTC GGCCGGGCTC
AGCGCGCTCC TCGACGGGGT CTCGAACGGC ATCCAGGTGA TCCAGGCCGC CAATACCGGC
ATCACCAAGC TGAAGAGCCT CACCGAACAG CTGAAATCGG TGGCCCAGCA GGCGCTCTCC
TCCGCCAATG CCTTCTCGGC CAAGGCCAGC GTGGCCTCGA CCACCCTGAC CGGGGCCACG
AGCACGAACC TGCTCTCGAT CGGCCCGACC GCGGCGGCGA GCGATGCCGC GATCGGCTCG
CTGCCGCCCG CCACGGCCGT GCAGCTGACC ATGACGGGCA GCGTCGATCT CAGCAATACG
GCCGCGATCC AGGCGGCGTT GTTCACGTCG AGCAACAGCC TGACGGTGAC CATCGACGGG
GTTCAGGTCA CGGTGAACAA GGGGATCGAC GACGCCAGCC CGACGGCTTT GGCCGCCTCG
ATCACGGCCC AGCTCAAGGC CGCCGGATCG TCGATCACGG TCGCGCCGAG CGGCGCCACC
CCCAACACGC TGGTCGCCAA GGGCACGGCG GACGGTGCGC CCTTCAGCAT CGGGACCGAC
GCCGCGACCA CCAAGCTGTT CGGCACGGTC ACGCAGTCGA GCAACGTCTT CGTGCCGACG
GCGACGACGC TCGCCACCGG CCTCGGCTTT CAGGTCGGGG ATTCCTTCAC GGTGAACGGC
CGCTCGGTGA CGATCGCCCG CAACGACACC CTGGCCTCCC TCGCCCAGAA GGTCGGCACC
GCCACCGGCG GCGCCGTGAC CGCGGCCTAC GACGCGGTGA GCCAGAAATT CGTCTTCACG
GCGGCGGATC CCGCCACCAC CATCGCCCTC GGCGACGGCG GCACCGCCAC CGGCAAGGTC
GCCAATCTCG GCTTCAGCAC CACGAGCTTC GGGGCCGGCC TCGGGGCCGG TTCCGGCACG
AACTTCAAGC AGAGCCCGCT CAACGGCCAG ACCATCACGG TGCGGGTGGC GAACGGCACC
GGCGTGACCC TGACCTTCGG CCCCGCCGCG GGCCAGATCT CGACCCTGAC GCAGCTCAAC
GCGGCGCTCG CCCCGGCCAA CGCGCAGGCG ACCCTCGATC CCACGACGGG GCAGATCAAG
CTCACCACCA CCAACGAGGC CGGCGCCGAC AGCCTGACGT TGATCGCCTC GCCGCCCCCG
ACCTCGAACA ACGTCGCCAA CCCGTTCAAC ACCGGCACGG CGATCGCCAC GATCGGCGGC
GACGGCCTCA CGGCGCGCAA CGGCCTCGTG ACGACCTATA ACGGCCTGCT CACCCAGATC
GACCAACTCG CCGCCGATGC GAGCTTCAAC GGCGTCAACC TGCTCGCCGG GGACAACCTG
ACGCTCAACT TCAACGAGCG CAGCACGTCG CAGCTCTCCG TGACCGGGGT CAACGTGTCC
GCGGCGAGCC TCGGCCTGAC GCCGGTCGGC CAGGCCGACT TCGTGGAGAG CATCGCGATC
AACAAGGTGC TCGCCACCAT CAACTCGGCC GCCAGCGCGC TGAAGAACCA GGCCGCGTCG
CTCGGCGCGA ACCTCGCCGT GGTGCAGAAC CGGCAGGACT TCACCAAGCA GCTCATCACC
GTGCTCGACA CCGGCGCGGC GAACCTGACC AACGCGGACA TGAACGAGGA AGCGGCGAAT
TCGCAGGCGC TGCAGACCCG CACCTCGCTC GGCACGTCGG CGCTGTCGCT CGCCAACCAG
GCGCAGCAGG CGATCCTCCA GCTCCTGCGC TGA
 
Protein sequence
MSSITLSAAT RQNLLSLQDT ASLLATTQNR LATGKKVNTA LDNPTNYFTA DGLTARSAGL 
SALLDGVSNG IQVIQAANTG ITKLKSLTEQ LKSVAQQALS SANAFSAKAS VASTTLTGAT
STNLLSIGPT AAASDAAIGS LPPATAVQLT MTGSVDLSNT AAIQAALFTS SNSLTVTIDG
VQVTVNKGID DASPTALAAS ITAQLKAAGS SITVAPSGAT PNTLVAKGTA DGAPFSIGTD
AATTKLFGTV TQSSNVFVPT ATTLATGLGF QVGDSFTVNG RSVTIARNDT LASLAQKVGT
ATGGAVTAAY DAVSQKFVFT AADPATTIAL GDGGTATGKV ANLGFSTTSF GAGLGAGSGT
NFKQSPLNGQ TITVRVANGT GVTLTFGPAA GQISTLTQLN AALAPANAQA TLDPTTGQIK
LTTTNEAGAD SLTLIASPPP TSNNVANPFN TGTAIATIGG DGLTARNGLV TTYNGLLTQI
DQLAADASFN GVNLLAGDNL TLNFNERSTS QLSVTGVNVS AASLGLTPVG QADFVESIAI
NKVLATINSA ASALKNQAAS LGANLAVVQN RQDFTKQLIT VLDTGAANLT NADMNEEAAN
SQALQTRTSL GTSALSLANQ AQQAILQLLR