Gene M446_5446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5446 
Symbol 
ID6132568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5979329 
End bp5980513 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content76% 
IMG OID641645580 
ProductVWA containing CoxE family protein 
Protein accessionYP_001772196 
Protein GI170743541 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.337348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCC GGCTGCCGGA CAACATCGCC CACTTCGCCC GCGCCCTGCG CAGGGCCGGG 
CTGCCGGTCG GGCCCGGGGA CGTCATCGAC GCGGTCGAGG CCGTGGTGGC GGCCCGGATT
GGCAGCCGCG AGGACCTCTA CTGGACGCTC CACGCCGTCC TGGTGCGCAA GCACGAGCAT
TCGGTGCTGT TCGAGGAGGC GTTCCGCCTG TTCTGGCGGC GGCGCGACCT CATCGAGGCG
CTGATCGCCC AGATGGCGCC GGTCGCGCCG GCCCGGGGGC GCGAGCCGCC GAAGGCCGGG
GCCCTGCGGG TGCGCGAGGC GCTCGCGCCC GACGCCCCCC GGCCCCGGCC GAAGCAGGAC
GAGGCGGCCC TGCGCGCCAC CCTCACGGTC TCCGAGCGCG AGGTGCTGAA GGCCAAGGAC
TTCGCCCAGA TGTCGACCGA GGAGGTGGCG CAGGCCCGCC GCCTGATCGC CGCCCTGGCG
CTGCCCGACG ACGCCGCCCG CACCCGGCGC TTCGCCCCGA CGGCGCGCGG GCGGATCGAC
CCGCGGCGCA GCTTCCAGCG CACCCTGCGC GCCCAGGGAA GCATCGACCT CGCGTTCCGG
TCGCCGCGGG TGCGGCCGCC GCCGATCGTC GCCCTCGCCG ACATCTCGGG ATCGATGGCG
GAGTACTCGC GCCTCTTCCT CCACTTCCTG CACGCCCTCG GCGAGCGGCG CCGGGTGCAC
AGCTTCGTCT TCGCGACGCG GCTCACCAAC ATCACCCGCG AACTCGCCCG GCGCGACCCG
GACGAGGCGC TCGTCCGGGC GAGCGCCCGG GCCCGCGACT GGGAGGGGGG CACCCGCATC
GCCGCGGCCC TGCACGCCTT CAACCGGCAC TGGTCCCGCC GGGTGCTCGG CGGGGGCGCG
GTGGTGCTGC TGTTCACGGA CGGGCTGGAG CGCGAGGTGA CGCCGGAACT GACCTTCGAG
ATGGACCGGC TGAAGCGCTC CTGCCGCCGC CTCGTCTGGC TGAACCCGCT CCTGCGCTTC
GACCGCTTCG AGGCCCGGGC GGCCGGCATC CGGGCGATGC TGCCCCACGT CCACGATTTC
CGGCCGATCC ACAGCCTCGC GGCGATGGAG GATCTCTGCC GCGCGCTCGG CGCAGGGCCG
GTCCGCGCCG ACGACCCGCG CGCTTGGTTG CGCCGGGCCG GCTGA
 
Protein sequence
MTGRLPDNIA HFARALRRAG LPVGPGDVID AVEAVVAARI GSREDLYWTL HAVLVRKHEH 
SVLFEEAFRL FWRRRDLIEA LIAQMAPVAP ARGREPPKAG ALRVREALAP DAPRPRPKQD
EAALRATLTV SEREVLKAKD FAQMSTEEVA QARRLIAALA LPDDAARTRR FAPTARGRID
PRRSFQRTLR AQGSIDLAFR SPRVRPPPIV ALADISGSMA EYSRLFLHFL HALGERRRVH
SFVFATRLTN ITRELARRDP DEALVRASAR ARDWEGGTRI AAALHAFNRH WSRRVLGGGA
VVLLFTDGLE REVTPELTFE MDRLKRSCRR LVWLNPLLRF DRFEARAAGI RAMLPHVHDF
RPIHSLAAME DLCRALGAGP VRADDPRAWL RRAG