Gene M446_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4224 
Symbol 
ID6135644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4662370 
End bp4663560 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content79% 
IMG OID641644368 
Productprotein of unknown function DUF610 YibQ 
Protein accessionYP_001771007 
Protein GI170742352 
COG category[S] Function unknown 
COG ID[COG2861] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.658046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0567423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCTCG CCAACGATGC CCTCACCCGA CCCCTCGGGA TGCGGGACGA GACCCCCTCC 
CGCCTCGCGC GCCTGCGGGC CGCCATCCCG CTGCGGGCGA TCCTGTCGGC CGCGGCGGCC
GTCCTGGTCG GGGCCGTCGC CGGGTTCGTC GCCCTGACGG AGGATCCCCT CGGCGGCGAG
CCGCACGCCC TGGTGACGAT CACCCGGCGC GAGGCCCCGC CCGGCCCGCC GAGCCCGCGC
CCGCCCGCCC CGGCGGAGGC CGGGAGCCGC AGCGCGAGCG AGGTCGAGCG CGCCTCGGGC
GTTGCGGTGC TGCGGCCGGA GGGCAGTGCC GTGCCGGATT CGGTGGTGAT CCGCGTGCCG
GGGCCGACCG AGATCCGCCT CGCCCCCGCG CCCGATCCGG CCCTCTCCGA GAAGGGCCGC
TACGGGCTGC TGCCGCGCCT CGGCCCGGAC GGCGCCCGCG CCGTCGACGT CTACGCCCGG
CCGGAGGCGC CCGCCCTGCC GAGCGGCGCG GCGCCGGCCG GCCGGATCGC CCTGGTGGTG
ACCGGCCTCG GGATCGGGGC GGCGGTGACG CAGGACGCGG TGACCCGGCT GCCGCCCGCC
GTCACCCTCG CCTTCGCCCC CTACGGCGCC GACGTGGGCC GGCAGGCGGC CCGCGCCCGC
GAGGCCGGCC ACGAGGTGAT GGTGCAGGCC CCGATGGAGC CCTTCGACTA CCCGGACAAC
GATCCCGGGC CGCAGACGCT GCTCGCGGGC GCCAAGCCCG CCGAGAACGC GGACCGCCTC
GGCTTCGTGC TCTCCCGCAT CCCGGGCGCG ATCGGGGTGG TGAACTACAT GGGGGCGCGG
CTCACCGCCG AGGCCGGGGC CCTCGACCCG ATCCTGCGCG AGATCGGGGC CCGCGGCCTC
GGCTTCGTCG ACGACGGCAC CTCGCCGCGC TCGCTCGCGC TCGACATCGG GCGGCGCGCC
CGCGCGCCGG TGGCCCGCGC CGACGTGGTC GTGGACGCGG CGCCGCTCCC CGACGCGGTC
GACCGCGAAC TCGCCCGGCT GGAGGAGACG GCGCGGCGCA AGGGCTTCGC GATGGGTTCG
GCCATGGCGC TGCCGCTCAC CATCGACCGG ATCGCCCGCT GGAGCCGCGA CCTGGAGGCG
CGGGGCATCC TGCTCGTCCC GGCGAGCCGC GCCCTCCGGG CGCGTCGGTA G
 
Protein sequence
MILANDALTR PLGMRDETPS RLARLRAAIP LRAILSAAAA VLVGAVAGFV ALTEDPLGGE 
PHALVTITRR EAPPGPPSPR PPAPAEAGSR SASEVERASG VAVLRPEGSA VPDSVVIRVP
GPTEIRLAPA PDPALSEKGR YGLLPRLGPD GARAVDVYAR PEAPALPSGA APAGRIALVV
TGLGIGAAVT QDAVTRLPPA VTLAFAPYGA DVGRQAARAR EAGHEVMVQA PMEPFDYPDN
DPGPQTLLAG AKPAENADRL GFVLSRIPGA IGVVNYMGAR LTAEAGALDP ILREIGARGL
GFVDDGTSPR SLALDIGRRA RAPVARADVV VDAAPLPDAV DRELARLEET ARRKGFAMGS
AMALPLTIDR IARWSRDLEA RGILLVPASR ALRARR