Gene M446_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3778 
Symbol 
ID6129123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4212229 
End bp4213275 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content73% 
IMG OID641643947 
Productribonuclease BN 
Protein accessionYP_001770591 
Protein GI170741936 
COG category[S] Function unknown 
COG ID[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.541742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGG ACGATGAGGG CGGGCAGGAT TGGCGCTCGG TGGGCACGCG CCGGCCGGAC 
GCGGCCGAGC GCGAGCGGGC CCTCGCCCGG GCCCGGGAGC CCGGCCGCGG CCGCACGGCC
GCGCAGCCCT CGGAGATCCC CTCGCGCGGC TGGGGCGACA TCCTCTGGCG CGTGGCCTGG
TCGGTGCCGC AGGACCGCGT GCTCGCGACC GCGGGCGGGG TCGCCTTCTT CGCCCTGCTC
GCCGTCTTCC CGGGGCTCGC CCTGATCGTG TCGCTCTACG GGCTCGTCGC GGATCCGGGC
GCCATCTACA AGCACCTGAG CCTGCTCACC GGGCTGCTGC CGCAGGCCGT CCTCGACCTC
CTCGCGGCCG AACTCAGCCG GGTGGCCGGC AAGAGCACGG GCGCGCTGGG CGCCGCCTCG
GGCGTGAGCC TGCTCGTCGC GTTCTGGAGC GCCAATTCGG GGGTGAGCGC CCTCTTCGAC
GCGCTCAACG TCATCTACAA GGAGCGGGAG AAGCGCCCGC TCCTGCACTT CTACGCCACC
ACCTTCCTGT TCACCCTCAC GGGCGTCCTG TTCGCGCTGG TGGCGACCGG CATGGTGGTG
GCCCTGCCGG TCGTGCTCAA CCAGGTCGGG TTCCAATCCT ACGCCAGCGA TGCGGCGCTG
CGCCTGCTGC GCTGGCCGGC GCTCCTCGTC CTGGTCAGCC TCGGCCTCGC GGCGATCTAC
CGCTACGGCC CGAGCCGGCA CGAGGCCAAG TGGCGCTGGG TGACCTGGGG CAGCGGCATC
GCCGCGCTGC TCTGGGTCTG CGCCTCGGCG CTCTTCTCCT GGTACGTGGC GCGCTTCGAC
AGCTACAACC GGATGTACGG GTCGCTCGGG GCCGGCGTCG GCTTCATGAC CTGGATCTGG
TTCTCCATCG TGATCGTGCT GCTCGGGGCC GAGCTGAACG CCGAGATGGA GCGGCAGACC
GTGCGCGACA GCACGACCGG CCGGCCGAAG CCCCTCGGCA CCCGCGACGC CCACGCGGCC
GACACGGTCG GCCCGAGCCA CGAGTGA
 
Protein sequence
MATDDEGGQD WRSVGTRRPD AAERERALAR AREPGRGRTA AQPSEIPSRG WGDILWRVAW 
SVPQDRVLAT AGGVAFFALL AVFPGLALIV SLYGLVADPG AIYKHLSLLT GLLPQAVLDL
LAAELSRVAG KSTGALGAAS GVSLLVAFWS ANSGVSALFD ALNVIYKERE KRPLLHFYAT
TFLFTLTGVL FALVATGMVV ALPVVLNQVG FQSYASDAAL RLLRWPALLV LVSLGLAAIY
RYGPSRHEAK WRWVTWGSGI AALLWVCASA LFSWYVARFD SYNRMYGSLG AGVGFMTWIW
FSIVIVLLGA ELNAEMERQT VRDSTTGRPK PLGTRDAHAA DTVGPSHE