Gene M446_3487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3487 
Symbol 
ID6129324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3891684 
End bp3894230 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content77% 
IMG OID641643658 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001770306 
Protein GI170741651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.18885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACG GAACGATCAC GGTAACGGCA AATTTCGTAC AGCCCCTAAC AAATGCAGTT 
TCGAATTCAG GCATCGTACT TCTTTATGAC ACGACCGGCA CATCGACCCT GTCTGCATCT
AATTTTATTG GATCTTCAGG ATATTACTTT ACACCCTCAG GCAGCACCAC AGGAACTTCC
GCATATTTCT CGCTGACAAC CACGGCGACC GTCGGAAGCG GCCGCAGCTT CACCGCCATC
ATCGAAAATC CAACCTACTT TAATTCACCA GCACCGACGA GCCTCACCAG TTCAGCAGTC
TACAATTCGA ATGGCCAGGA TACCGGCGCG TCCGCAACAC TTAGTGCTGG CGGCGTCGAC
AAGGCAAACG GTTTGGGTGG AGCGAACTAT GACGTGAATT ACGCGAAGCT CGTCTTCTCG
AATGTCACCG TCACGGCCGG AGGGGGCGCA CCCGGGCCGA CAGGTGCCAG CGGCGCAACG
GGCGCAACGG GCGCCACTGG CGCAACGGGG AGCGGAGCCA CAGGCGCGAC AGGGGCCACC
GGCGCCACGG GCACGGGCGC GACCGGGCCG ACCGGGGCCG CGGGCTCCAC CGGCGCGACG
GGCACGGCCG GTGCGACGGG CGCCACGGGA GCGGGCGCCA CAGGCGCCAC AGGCGCGACC
GGTGCGACGG GAGACACGGG CCCCACGGGC GCGGGGGCGA CCGGAGCCAC AGGCGCGACG
GGCGCGGGGG CGACCGGCGC CACGGGATCG GATGGCGCGA CCGGCGCCGC GGGCGCGACC
GGTGCGACGG GATCCGCCGG TGCGACGGGG GCGACGGGCA CGGCGGGGAC CACGGGCGCG
ACCGGCGCCG CGGGCGCGGA TGGCGCCGGG GGCGCGACCG GCAGCACCGG CGCCGCGGGC
GCGACGGGTG CCACCGGCGC GGACGGCGCG AGGGGCGCCA CGGGCAGGGA CGGCGCCGCG
GGGGCGACCG GCGGCACAGG GGCCGCAGGC GCAACCGGCG GCACGGGAGC GGCCGGCAGC
ACCGGCGCCG CGGGCGCAAC GGGTGCCACC GGCGCAGATG GCGCGAGGGG CGCCACGGGC
AGGGACGGCG TTGCCGGGGC GACCGGCGGC ACGGGGGCTG CGGGTGCGAC CGGCGCCACG
GGCGCGGACG GCGCGAGGGG TGCCACGGGC AGGGACGGCG CCACCGGGGC GACCGGCAGC
ATGGGGGCTG CGGGTGCGAC CGGCGCGACC GGAATCGGTG CGACGGGGTC GGCCGGCAGC
ACCGGCGCCG CGGGCGCGAC GGGTGCCACC GGCGCGGACG GCGCGAGGGG CGCCACAGGC
AGGGACGGCG CCGCGGGGGC GACCGGCAGC ACGGGGGCTG CGGGTGCGAC CGGCGCGACC
GGAATCGGCG CGACGGGGTC GGCTGGCAGC ACCGGCGCCG CGGGCGCAGA TGGCGCGAGG
GGCGCCACGG GCAGGGACGG CGCTGCCGGG GCGACCGGCA GCACGGGGGC CGCAGGCGCA
ACCGGCGCCA CGGGCGCGGA CGGCGCGAGG GGTGCCACGG GCAGGGACGG CGCCACCGGG
GCGACCGGCA GCATGGGGGC TGCGGGTGCG ACCGGCGCGA CCGGAATCGG TGCGACGGGG
TCGGCCGGCA GCACCGGCGC CGCGGGCGCG ACGGGTGCCA CCGGCGCGGA CGGCGCGAGG
GGTGCCACGG GCAGGGACGG CGCTGCCGGG GCGACCGGCA GCACGGGCGC CGCGGGCGCG
ACCGGTCCCG CTGGGGCCAC GGGCGCCGGC GGCACGACCG GCGCGACGGG GGCGGCCGGG
GCGACGGGCA CCGCCGGAGC CACCGGCGCG ACCGGGAGCA CCGGACCGGC CGGGGCCACC
GGCGCCACCG GCCCCGAATG CTTCACCCAC GGCACCCGCC TGCTCACGCT CACGGGCGCG
CGCCGCGTCG AGGATCTCGC GGTCGGCGAC CGCCTCCTCA CCGCCGCCGG CGAGGCCCGG
CCGGTGGTCT GGATCGGGCG CCGCCGCCTG CGCCCCGACG CCCATCCCCG CCCCGACCGC
GTCCGCCCGG TGCGGATCCG GGCCGGGGCC CTGGCGCCGG GCCTGCCGGA GCGCGACCTC
CTGCTCTCGC CCGGCCACGG GGTGCTGTTC GCCGGCCACC TGATCCCGGC CGGCCTGCTG
GTCGACGGGC GCGGCGTGGC GGTGGAGGCC GTGGCCGAGG TGGAGTACCT GCACGTGGAA
CTCGACCTGC ACGACGTGGT CCTGGCCGAG GGCGTGCCCT GCGAGAGCTA CCTCGATGCC
GGGCAGCGGG CGGATTTCGA GGAGGCGGGC GGGGTGACGC GCCTGCACCC GGTCTTCCTG
CCGCTGACCT ACGAGGCGGC CTGCGCGCCG CTGGCGGTGG CCGGCCCGGT GCTGGCGGCG
GCGCGGGCGC AGATCGCGGC CCGGGCGGAG GCCCAAGCCG AGGCCGCAGC CCAAGCCGAG
GCCCAAGTCG GGGCCCAAGT CGAGGCCCAA GCCGAGGCCG CGGCGCGGGA GGGCGAGGCC
GCGCCGCGCC GGCGGCCGGC CGGGTGA
 
Protein sequence
MAYGTITVTA NFVQPLTNAV SNSGIVLLYD TTGTSTLSAS NFIGSSGYYF TPSGSTTGTS 
AYFSLTTTAT VGSGRSFTAI IENPTYFNSP APTSLTSSAV YNSNGQDTGA SATLSAGGVD
KANGLGGANY DVNYAKLVFS NVTVTAGGGA PGPTGASGAT GATGATGATG SGATGATGAT
GATGTGATGP TGAAGSTGAT GTAGATGATG AGATGATGAT GATGDTGPTG AGATGATGAT
GAGATGATGS DGATGAAGAT GATGSAGATG ATGTAGTTGA TGAAGADGAG GATGSTGAAG
ATGATGADGA RGATGRDGAA GATGGTGAAG ATGGTGAAGS TGAAGATGAT GADGARGATG
RDGVAGATGG TGAAGATGAT GADGARGATG RDGATGATGS MGAAGATGAT GIGATGSAGS
TGAAGATGAT GADGARGATG RDGAAGATGS TGAAGATGAT GIGATGSAGS TGAAGADGAR
GATGRDGAAG ATGSTGAAGA TGATGADGAR GATGRDGATG ATGSMGAAGA TGATGIGATG
SAGSTGAAGA TGATGADGAR GATGRDGAAG ATGSTGAAGA TGPAGATGAG GTTGATGAAG
ATGTAGATGA TGSTGPAGAT GATGPECFTH GTRLLTLTGA RRVEDLAVGD RLLTAAGEAR
PVVWIGRRRL RPDAHPRPDR VRPVRIRAGA LAPGLPERDL LLSPGHGVLF AGHLIPAGLL
VDGRGVAVEA VAEVEYLHVE LDLHDVVLAE GVPCESYLDA GQRADFEEAG GVTRLHPVFL
PLTYEAACAP LAVAGPVLAA ARAQIAARAE AQAEAAAQAE AQVGAQVEAQ AEAAAREGEA
APRRRPAG